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ABSTRACT 


This  paper  reports  some  further  analyses  and  applications  of  Morrison's 
model  of  the  predictive  relationship  between  measures  of  intentions  and  sub- 
sequent purchasing  behavior.  A  review  of  published  studies  bearing  on  the 
threats  to  predictive  validity  of  intention  scales  represented  in  Morrison's 
model  is  presented.  Findings  from  a  test-retest  study  of  intention  ratings 
for  concept  stimuli  are  shown  to  be  consistent  with  the  levels  of  reliability 
expected  under  the  model's  assumptions  of  beta  binomial  distributed  scores. 
Evidence  of  the  predictive  validity  of  intention  measures  is  found  in  a  re- 
analysis  of  several  sets  of  relevant  data  but  a  different  form  of  predictive 
relationship  is  shown  to  hold  for  generic  durable  goods  as  compared  to  brand- 
ed packaged  goods.  Whereas  a  linear  relationship  is  supported  in  the  case  of 
durable  goods,  the  presence  of  a  threshold  phenomenon  in  the  branded  packaged 
goods  data  suggests  the  use  of  apiecewise  linear  model.  There  is  reason  to 
believe  that  the  sources  of  systematic  error  present  in  intentions  ratings 
are  different  for  these  two  types  of  purchases. 


INTRODUCTION 

Purchase  intention  is  one  of  a  very  small  set  of  variables  that  find 
routine  application  in  consumer  research  investigations  undertaken  for  a 
variety  of  different  purposes  (e.g.,  new  product  concept  and  copy  tests; 
segmentation  and  tracking  studies)  and  covering  a  broad  range  of  products 
and  services.  The  collection  of  purchase  intention  data  has  become  fairly 
standardized  in  the  sense  that  one  or  the  other  of  two  basic  types  of  in- 
struments appear  to  be  employed  in  much  of  this  work,  either  a  five  point 
intention  scale  (definitely  will  buy  =  5;  definitely  will  not  buy  =  1 )  or 
the  eleven  point  purchase  probability  scale  developed  by  Juster  (1966). 
Despite  widespread  utilization  of  these  scales,  reservations  and  criticisms 
have  been  expressed  frequently  about  this  reliance  on  intention  measures  with 
the  issue  of  predictive  ability  being  the  paramount  concern.  Do  intentions 
expressed  at  a  particular  point  in  time  relate  to  future  purchase  events? 

The  published  evidence  available  on  this  matter  is  much  more  abundant 
for  purchasing  behavior  at  the  level  of  generic  types  of  durable  goods  than 
for  the  purchasing  of  specific  brands  within  product  categories--either  durable 
or  non-durable.  Numerous  studies  conducted  by  economists  have  found  statis- 
tically significant  relationships  between  prior  intentions  and  subsequent 
purchasing  of  generic  durables  at  both  the  individual  household  and  aggregate 
levels  but  the  forecasting  value  of  intentions  data  has  been  disappointing 
and  controversial  (McNeil  1974;  Adams  1974;  and  Juster  1974).  However, 
turning  to  the  matter  of  intentions  to  buy  particular  brands,  one  finds  only 
bits  and  pieces  of  data  in  the  marketing  research  literature  relevant  to  the 
question  of  predictive  validity.  On  a  few  occasions,  intention  studies 
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have  been  followed  up  with  efforts  to  obtain  reports  of  subsequent  pur- 
chasing behavior  from  the  same  sample  of  respondents.  While  positive  asso- 
ciations between  intentions  and  purchases  have  generally  been  observed  in 
such  data  (Gormley  1974;  Laroche  and  Howard  1980;  Penny,  Hunt,  and  Twyman 
1972;  Rothman  1964;  Smith,  Davenport,  and  Parker  1963;  Tauber  1975;  Taylor, 
Houlahan,  and  Gabriel  1975;  Twyman  1973;  and  Wells  1961),  the  strength  of 
the  relationship  uncovered  in  these  analyses  has  not  been  viewed  as  sufficiently 
marked  and  consistent  to  allay  the  basic  concern  about  the  predictive  validity  of 
intention  scales  noted  above  and  shared  by  many  in  the  marketing  research  community. 
Another  body  of  research  has  been  accumulating  on  the  psychometric  prop- 
erties of  intention  scales.  Published  studies  have  reported  evidence  bearing 
on  the  vulnerability  of  these  intention  measures  to  response  style  biases 
(Clancy  and  Garsen,  1970),  their  ability  to  discriminate  among  alternative 
brands  or  product  concepts  (Gold  and  Sal  kind, 1974;  Haley  and  Case,  1979;  and 
Ptacek  and  Ross,  1980),  and  their  sensitivity  to  the  effects  of  advertising 
exposure  (Axelrod,  1968).  The  presence  of  substantial  components  of  random 
and/or  systematic  errors  in  intention  measures  will,  of  course,  tend  to  affect 
adversely  both  their  discriminant  and  predictive  abilities.  What  past  re- 
search in  this  area  has  lacked  is  an  integrating  framework  which  would  allow 
assessments  of  the  discriminant  and  predictive  validity  of  intention  scales 
to  take  explicit  account  of  the  major  sources  of  fallibility  present  in  such 
measurements. 

Recently,  Morrison  (1979)  has  made  an  important  contribution  to  enhanc- 
ing understanding  of  the  quality  of  intention  measures  and  to  improving  their 
practical  usefulness.  Morrison  has  proposed  a  model  of  the  linkage  between 
intentions  and  purchasing  which,  among  other  things,  represents  several  threats 
to  the  predictive  validity  of  intentions  measures  and  lends  itself  to  statis- 
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tical  estimation  and  testing.  Thus  for  the  first  time,  a  formal  model  has 
been  specified  for  intention  scaling  similar  in  spirit  to  the  psychometric 
models  which  are  the  foundation  of  psychological  measurement. 

The  purpose  of  the  present  paper  is  to  report  some  further  analyses  and 
empirical  applications  of  Morrison's  model.  We  begin  with  a  summary  of  the 
model,  emphasizing  its  psychometric  interpretation  and  correcting  a  minor 
error  in  one  of  the  original  equations.  In  the  second  section  of  the  paper, 
we  review  the  evidence  available  relating  to  two  threats  to  the  predictive 
validity  of  intention  measures  represented  in  Morrison's  model:  random  measure- 
ment error  or  unreliability  and  systematic  error  or  bias.  Some  empirical  esti- 
mates of  the  reliability  of  a  seven  point  intention  scale  are  reported  from  a 
small  scale  test-retest  study  of  new  product  concepts.  An  examination  of  the 
evidence  on  bias  draws  attention  to  possible  differences  in  the  properties  of 
intention  scales  according  to  whether  they  are  applied  to  generic  durable 
goods,  a  brand  of  frequently  purchased  consumer  products,  or  concept  test 
stimuli. 

The  third  section  of  the  paper  discusses  the  results  obtained  by  apply- 
ing Morrison's  model  to  two  different  classes  of  intention-purchase  behavior 
data.  One  body  of  data  pertains  to  the  purchasing  of  generic  types  of  dur- 
able goods  in  the  United  Kingdom.  The  intention  measure  is  the  same  eleven 
point  purchase  probability  scale  used  in  the  well-known  Juster  (1966)  study 
which  Morrison  reanalyzed.  The  second  set  of  data  examined  involves  brand 
purchasing  of  packaged  goods  and  here  the  instrument  used  to  measure  inten- 
tions was  either  a  five  point  or  a  seven  point  scale.  Thus  we  investigate 
the  applicability  of  Morrison's  model  to  a  broader  range  of  situations  than 
those  he  examined  for  illustrative  purposes  in  his  original  paper.  Maximum 
likelihood  methods  are  used  for  parameter  estimation  and  likelihood  ratio 


tests  are  conducted  to  compare  the  goodness-of-fit  of  two  alternative  speci- 
fications of  the  relationship  between  the  measures  of  intentions  and  purchasing: 
(a)  the  linear  relation  used  by  Morrison,  and  (b)  a  piecewise  linear  model 
suggested  by  empirical  findings  reported  by  Haley  and  Case  (1979)  and  Laroche 
and  Howard  (1980)  with  reference  to  packaged  goods  and  by  the  "top  box"  scor- 
ing rules  frequently  used  by  marketing  research  practitioners  in  analyzing  five 
point  intention  scale  data  (Gold  and  Salkind  1974;  and  Gruber  1970).  We 
conclude  with  a  discussion  of  the  implications  of  our  findings  for  further 
testing  of  the  model  and  its  applications. 

MODEL  STRUCTURE 

Figure  1  summarizes  the  overall  structure  of  Morrison's  model.  The 
principal  aim  is  to  model  the  "predictive"  relationship  between  the  two 
observables:  an  individual  consumer's  intentions  measured  at  a  particular 
point  in  time  (I^)  and  his/her  probability  of  purchasing  (P  )  as  manifested 
in  the  subsequent  purchase  events  that  occur  and  are  reported  after  some 
time  interval  (a).  Intentions  are  measured  with  error  (e)  and  hence  expressed 
intentions  (I  )  are  an  imperfect  indication  of  true  intention  (I  )  which  may 
change  over  the  time  interval   and  have  a  different  unobserved  true  value 
Cl!)  due  to  a  host  of  possible  influences  (e.g.,  unexpected  change  in  income, 
later  consumption  experience,  etc.)  all  of  which  are  summarized  in  a  single 
change  parameter  (p).  Intentions  may  also  be  subject  to  systematic  sources 
of  error  (response  style  biases  mentioned  previously,  the  effects  on  promo- 
tion and  the  like)  and  hence  Morrison  proposes  to  adjust  for  such  biases  via 
a  parameter,  b.  Thus  the  true  predictive  validity  of  intentions  may  be  ob- 
scured in  the  observed  relation  with  purchasing  due  to  the  presence  of  three 
types  of  threats:  (a)  random  measurement  error  in  expressed  intentions,  (b) 
instability  or  change,  and  (c)  bias  in  true  intentions. 
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INSERT  FIGURE  1  HERE 

Morrison  models  random  measurement  error  at  the  individual  level  by 
assuming  a  respondent's  expressed  intention  (I)  is  a  random  variable  bi- 
nomially  distributed  with  parameters  I.  (true  intention)  and  n,  where  n+1 
is  the  number  of  points  or  alternative  response  categories  presented  in  the 
intention  scale.  True  intentions  are  assumed  to  be  beta  distributed  in  the 
population  and  then  the  aggregate  unconditional  distribution  of  expressed 
intentions  across  consumers  is  beta  binomial.  A  key  feature  of  the  beta 
binomial  formulation  stressed  by  Morrison  is  that  it  leads  to  a  linear 
relationship  between  the  expected  value  of  true  intention  and  the  expressed 
or  measured  intention.  Below  we  discuss  some  further  interpretations  of  the 
beta  binomial  assumption  and  investigate  its  implications  regarding  scale 
reliability. 
The  Reliability  of  Beta  Binomial  Intention  Scales 

Respondents  are  asked  to  express  their  purchase  intentions  by  checking 
one  of  the  n+1  labelled  response  categories  which  the  instrument  provides. 
The  response  alternatives  are  ordered  to  represent  a  monotonic  increasing  (or 
decreasing)  level  of  intention  and  responses  are  assigned  an  arbitrary  inte- 
ger score  X  =  0,  1 ,  2,... ,  n  which  reflects  this  ordering. 

Modeling  a  respondent's  behavior  in  selecting  a  response  alternative 
as^  i_f  it  were  the  outcome  of  a  binomial  process  is  a  strong  assumption 
which  Morrison  (1979,  p. 67)  points  out  is  "untestable"  but  useful  in  that  it 
provides  a  "reasonably  specific  representation"  of  the  conventional  measure- 
ment model  where  the  observed  score  is  taken  as  a  sum  of  a  true  value  and  a 
random  error  component.  Morrison  hypothesizes  that  "an  individual  with  true 
intention  I.  responds  1  or  0  in  an  independent  fashion  to  each  point  on  the 
scale  with  probabilities  I.  and  l-I^,  respectively ,"  and  hence  the  observed 
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intention  (x)  is  "the  sum  of  these  0,  1  responses"  (p. 67).  A  somewhat  diff- 
erent way  to  conceive  of  the  respondent's  behavior  is  to  presume  that  he/ 
she  regards  the  specified  response  alternatives  as  forming  an  interval  scale 
(where  I^  =  x/n)  and  then  selects  the  particular  category  whose  value  is 
closest  to  his/her  particular  true  intention  I^.  Uncertainty  in  choosing  the 
appropriate  discrete  response  category,  momentary  fluctuations  in  attention, 
and  various  other  kinds  of  transitory  factors  could  lead  a  respondent  to 
check  different  points  on  the  scale  if  the  task  were  repeated  and  the  result- 
ing frequency  of  response  is  postulated  to  be  binomial.  Such  a  line  of 
reasoning  is  akin  to  Thurstone's  notion  of  a  "discriminal  dispersion"  used  in 
formulating  his  laws  of  comparative  and  categorical  judgement  which  assume  a 
normal  distribution  of  responses  (Torgerson  1958,  pp.  156-158).  A  useful 
way  to  illuminate  the  meaning  of  the  binomial  assumption  here  is  to  examine 
the  curves  describing  the  probability  of  endorsing  the  various  response  cate- 
gories as  a  function  of  the  individual's  underlying  true  intention.  The  family 
of  such  curves,  known  as  the  trace  lines  or  operating  characteristics  (Torgerson 
1958,  p.  362),  are  shown  in  Figure  2  for  a  five  point  scale.  Thus,  a  respon- 
dent with  true  intention  (I^)  of  .75  would  be  most  likely  to  check  4  on  a 
five  point  scale  but  would  have  a  non-zero  probability  of  choosing  any  of 
the  other  points  as  shown  in  Figure  2. 


INSERT  FIGURE  2  HERE 


To  obtain  an  expression  of  the  reliability  of  an  n+1  point  intention 
scale  consistent  with  Morrison's  beta  binomial  assumptions,  we  begin  with 
the  usual  measurement  model : 
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where  I  and  I  represent  observed  and  true  intentions,  respectively,  and  e  is  the 
random  measurement  error.  For  fixed  true  score,  I. ,  the  observed  score  (I  )  is  dis- 
tributed binomial  with: 

E[iJi,J  =  rt.  (2) 

and, 

^t(^-^t^ 

v-[i.l^tJ=^ir^'  (3) 

where  the  expectation  and  variance  operators  are  over  the  respondent  population. 
Equations  (2)  and  (3)  imply  that,  for  a  fixed  true  score  I^,  the  measurement  error 
is  unbiased,  that  is: 

E[e|I^J  =  0  .  (4) 

and 

1,(1-1.) 
Var[e|I^]  =    n    •  (^^ 

It  is  worth  noting  that  the  conditional  distribution  of  the  measurement  error, 
e,  is  not  independent  of  the  true  score,  I.  .  This  is  to  be  expected  since  the 
model  contains  bounded  observed  scores  (0  ^  I  ^  1)  and  unbiased  measurement  errors. 

Recall  that  reliability  of  a  scale  is  defined  as  the  proportion  of  the 
variance  in  observed  scores  accounted  for  by  the  variance  in  true  scores  (Lord  and 
Novick  1968,  p.  61).  For  the  model  set  forth  in  equation  (1),  the  reliability  of  the 


intention  scale  is  given  by: 
Var[I^J 


R*  =  v^^n^  .  (6) 


The  variance  of  I  which  follows  the  beta-binomial  distribution  is; 


Var[l^]  =  Var[I^]  +  E 


Var[e|l^] 


17) 


The  first  term  on  the  right  hand  side  of  (7)  represents  the  variance  of  the  conditional 
means  and  the  second  term  is  the  mean  of  the  conditional  variances.  Using  equation 
(5)  and  carrying  out  the  expectation  and  variance  opertations  we  find  the  two  right- 
hand  side  terms  in  equation  (7)  simplify  to  (see  Appendix  A): 


Var[I^]  =  a3/(a+e)^(a+e+1)  ,  (8) 

[e  Var[e|I^  =  aB/n(a+B){a+e+l)   .  (9) 

Adding  the  two  variance  terms  in  equations  (8)  and  (9)  yields: 

Var[I^J  =  a3(a+e+n)/n{a+3)^(a+3+1)   .  (10) 

Therefore,  the  reliability  of  the  beta  binomial  distributed  intention  scale  is 
given  by: 

R*  =    a3/(a+3)^(a+3+1) 


2" 

a3(a+3+n)/n(a+3)  la+3+l) 


I  a+3+ny 


(11) 


The  reliability  expression  for  the  beta  binomial  intention  scale  in  equa- 


tion (llj  has  several  interesting  features.^  First,  reliability  of  the  beta  binomial 
distrubuted  intention  scale  will  differ  across  stimuli  according  to  variations  in  the 
sum,  a+3.  Reliability  will  increase  as  the  term  a+3  approaches  zero.  This  is  in- 
tuitively appealing  since  when  the  term  a+3->0,  the  beta  distribution  has  virtually 
all  of  its  mass  concentrated  near  0  or  1 .  Then  for  most  people,  the  measurement 
error  in  equation  (5)  would  be  near  zero  and  the  intention  scale  would  be  highly 
reliable.  At  the  other  extreme,  the  scale  will  become  completely  unreliable  as  the 
term  a+3  becomes  large.  As  Morrison  (1979,  p. 67)  notes,  this  is  consistent  with 
the  fact  that  the  beta  binomial  distribution  becomes  a  "spike"  at  its  mean 
a/U+e)  when  the  torm  a+&-^.     Under  such  circumstances,  all  respondents  would 
have  the  same  true  intention  so  that  the  measurement  error  present  would  dominate 
the  variance  in  the  true  scores. 

A  second  important  property  is  that  reliability  of  a  beta  binomial  intention 
scale  is  expected  to  improve  when  the  number  of  scale  points  is  increased.  Thus, 
given  a  =  3  =  0.5,  the  reliability  of  a  five  point  scale  (n  =  4)  is  expected  to 
increase  from  4/5  =  0.80  to  10/11  =  0.91  when  the  number  of  scale  points  is  raised 
to  eleven  (n  =  10).  This  relation,  {^^)   corresponds  to  the  result  found  in  classical 
test  theory,  the  Spearman-Brown  formula,  which  indicates  that  the  reliability  of 


a   composite  score  increases  monotonically  as  a  test  is  lengthened  by  adding  more 
items  which  are  homogeneouswith  respect  to  the  original  set  in  the  sense  of  pre- 
serving the  pattern  of  inter-item  correlations  (Lord  and  Novick  1968,  pp.  112-224). 
Thus,  in  Morrison's  model  of  a  single  or  one  item  rating  scale,  the  number  of  response 
alternatives  or  levels  it  contains  plays  an  analogous  role  to  that  of  the  number 
of  items  in  the  standard  psychometric  model  of  a  composite  score  derived  from 
multiple  items. 
Relating  Purchase  Intentions  and  Purchase  Behavior 

Citing  the  work  of  Keats  and  Lord  (1962],  Morrison  (1979,  p.  67)  points  out 
that  a  beta  binomial  distributed  scale  will  yield  a  linear  relationship  between 
the  expected  value  of  true  intention,  I^,  and  stated  intention,  I^.  He  notes 
that  this  linear  relation  is  given  by: 

Our  derivation  of  this  relationship  (see  Appendix  B)  leads  to  a  slightly 
different  result: 

Thus  we  find  that  the  equation  (l2)  given  by  Morrison  (!979,  p.  67}  contains  an  error 
and  should  be  replaced  by  the  corrected  version  (13). 

Morrison  (1979,  p.  66}  suggests  a  modification  in  the  beta  binomial  model  to 
account  for  changes  in  true  intentions  of  individual  respondents  during  the  time  frame 
of  interst  due  to  exogenous  factors  such  as  onexpected  changes  in  income.  His 
proposed  modification  does  not  affect  the  linear  nature  of  the  relationship  between 
the  expected  value  of  I  and  I..  Basically,  the  modification  allows  a  change  to 
occur  in  an  individual's  true  intention  with  orobability,  p,  and  given  such  a  change, 
the  individual  is  assigned  a  new  true  intention  I|,  randomly  drawn  from  the  original 
beta  density  function.  The  new  relationship  between  the  expected  value  of  I|.  and 
I  is  given  by  (see  Morrison  1979,  d.  68): 
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'^'-  t'  X-^   a+B    a+B+n     a+6+n   X   ' 

=  pEEl^]  +  (l-p)E[I^|I^]  .  (14) 

where  E[I'  I  ]  denotes  the  expected  value  of  revised  true  intention.  This  modifi- 
cation in  the  beta  binomial  model  leads  to  a  reduction  in  the  slope  of  the  linear 
relationship  between  the  expected  value  of  I^  and  I^. 

As  mentioned  earlier,  intention  measures  are  susceptible  to  systematic  sources 
of  error  due  to  response  style  tendencies,  promotional  effects,  changes  in  the  economy, 
etc.  In  Morrison's  model,  bias  introduced  in  E[I^  |l^]  by  these  systematic  errors  is 
assumed  to  be  constant  across  all  respondents.  The  overall  bias  for  the  samole  of 
respondents  in  equation  (14)  is: 

b  =  T^  -  TT  ,  (15) 

where  7  is  the  mean  of  the  observed  intention  scale  values  and   is  the  overall 
proportion  of  respondents  who  subsequently  purchase.  Morrison  (1979,  p.  68) 
adjusts  for  systematic  errors  by  subtracting  the  overall  bias  b  from  each  E[I(llx^* 

p,  =  E[iiig-b 

-  p«    I  (^-p)    I  n-P)n  J  _^ 

a+B   a+B+n    a+B+n   X 

where  P  is  the  probability  of  purchasing  given  expressed  intention,  I  and  R  is 
X  ^ 


the  reliability  index  defined  in  (11  j.  Hence  we  see  from  the  final  model  (16)  that  the 
predictive  validity  of  stated  ■ 
bility  (R  )  and  biasedness  (b), 


predictive  validity  of  stated  intentions,  I^'s  depends  upon  their  stability  (p),  relia 
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RESEARCH  ON  RANDOM  ANU  SYSTEMATIC  ERRORS  IN  INTENTIONS  SCALES 

In  this  section  we  review  evidence  bearing  on  two  properties  of  intention  scales 
which  represent  potential  threats  to  their  predictive  validity  specified  in  Mor- 
rison's model.  The  first  is  reliability,  which  relates  to  the  random  error 
component  present  in  intention  ratings—a  subject  which  appears  to  have  been  largely 
ignored  in  past  research.  We  present  some  empirical  estimates  of  reliability 
and  compare  them  to  the  values  expected  under  the  assumption  that  the  ratings 
follow  a  beta  binomial  distribution.  Bias  is  the  second  threat  discussed  below 
along  with  the  attendant  problem  of  discriminant  ability.  Consideration  is 
given  to  how  systematic  errors  may  vary  according  to  whether  intentions  are 
measured  with  reference  to  the  purchasing  of  generic  durable  goods  or  branded 
packaged  goods.  Finally  the  issue  of  how  many  response  categories  should  be  used 
in  an  intention  scale  is  examined. 
Reliability 

A  review  of  the  marketing  research  literature  indicates  that  surprisingly 
little  attention  has  been  given  to  the  reliability  of  intention  scales.  The  only 
published  reliability  estimate  for  an  intention  scale  the  present  authors 
were  able  to  uncover  is  that  found  in  Ptacek  and  Ross  (1979).  The  estimates  were 
based  on  data  collected  in  a  concept  testwherein  each  respondent  provided  an 
intention  rating  for  each  of  seven  new  product  stimuli.  The  ANOVA  model  (Ebel 
1951;  Winer  1971,  pp.  283-293)  used  to  estimate  the  variance  components  makes 
the  highly  restrictive  assumption  that  all  respondents  had  the  same  true  intention 
score  for  a  given  concept.  Within  concept  variations  in  ratings  were  attributed  to 
systematic  between-respondent  mean  differences  in  response  style  biases  plus 
random  measurement  error  and  hence  no  allowance  was  made  for  individual  differ- 
ences in  true  intention.  Ihe  reliability 
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coefficients  reported  used  between  concept  mean  differences  rather  than  the 
customary  between  respondent  differences  as  the  true  score  variance  estimator. 
It  can  be  shown  (Winer  1971,  p.  291)  that  the  reliability  of  a  rating  computed 
in  this  manner  is  appoximately  equal  to  the  average  intercorrelation  between 
ratings  of  the  seven  concepts  given  by  pairs  of  respondents.  Thus,  the  small 
values  of  the  reliability  coefficients  reported  by  Ptacek  and  Ross  (.110  for 
the  5  point  intention  scale  and  .176  for  the  eleven  point  Juster  scale)  are 
probably  best  interpreted  as  an  indication  of  respondent  heterogeneity  rather 
than  as  evidence  of  the  low  reliability  of  the  individual  respondents  intention 
ratings. 

The  only  other  data  found  in  the  literature  bearing  on  the  issue  of  scale 
reliability  arise  from  situations  where  both  the  eleven  point  purchase  probability 
scale  and  the  five  point  intention  scale  have  been  administered  to  the  same 
sample  of  respondents  with  reference  to  the  same  products  or  stimuli.  Such  a 
study  may  be  viewed  as  a  test-retest  situation  with  two  different  measures  of  the 
same  trait,  purchasing  intentions„  If  the  beta  binomial  model  holds  separately 
for  each  intention  scale  and  if  respondents'  true  intentions  are  constant 
across  administrations  of  the  two  scales  and  thus  are  beta  distributed  with  the 
same  parameters  (a  and  3),  then  the  distributions  of  the  two  sets  of  observed  ratings 
will  be  beta  binomial  but  differ  due  to  differences  in  the  number  of  scale  points 
used  in  each  scale  (5  vs.  11).  It  is  readily  shown  that  the  product-moment  correla- 
tion between  pairs  of  such  intention  ratings  (X^  ,X2)  scales  is  given  by: 

1/2 
r(X. 


/__V2___J\  ' 

''^*h^  ~    l(n^+a+e)(n2+a+6)) 


Thus  the  product-moment  correlation  between  intentions  ratings  obtained  on,  say, 
five  and  eleven  point  scales  from  the  same  population  of  respondentsr  with  reference 
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to  the  same  product  or  stimuli  is  equal  to  the  geometric  mean  of  the  reliabilities 

*     *  2 

of  the  individual  scales,  R.  and  R^.  Assuming  r(XiX2)  >  0,  then  r  (X^X^)  is  a 

* 
lower  bound  for  the  two  reliabilities  since  0  :£  K  <  1: 

R^^  >  r'^(X^X2)  -  '^2  • 

Juster  (1966,  p.  677)  presents  a  cross-tabulation  of  responses  to  a  six  month 
buying  intention  question  involving  five  response  categories  asked  with  reference 
to  automobiles  against  responses  to  his  eleven  category  purchase  probability 
scale  obtained  "a few  days"  later  from  the  same  sample  of  447  consumers.  The 
product-moment  correlation  between  these  two  scales  was  found  by  the  present 
authors  to  be  0.576  which  would  imply  a  lower  bound  of  0.332  on  the  reliabilities 
of  the  eleven  point  and  five  point  intentions  measures  used  by  Juster  under  the 
assumptions  of  the  beta  binomial  model. 

Another  piece  of  evidence  indicating  that  the  five  point  intention  scale 
and  the  eleven  point  Juster  probability  scale  are  highly  intercorrelated  is 
found  in  Haley  and  Case  (1979J  factor  analysis  of  thirteen  widely  used  rating 
scales  administered  among  a  sample  of  630  women  with  reference  to  the  six 
leading  brands  in  six  categories  of  packaged  goods.  The  rotated  loadings  on  the 
main  "evaluative"  factor  were  very   high  for  both  intention  scales:  .88  for  the 
five  point  scale  and  .86  for  the  eleven  point  Juster  scale.  The  between  scale 
correlation  indicated  by  these  factor  loadings  is  .76  and  implies  a  lower  bound 
of  .57  for  their  respective  reliabilities. 

Harry  Davis  generously  made  available  to  us  data  from  an  unpublished  study 
he  conducted  of  the  test-retest  reliablility  of  a  seven-point  intention  scale 
administered  in  a  concept  test  setting.  The  stimuli  rated  were  descriptions  of 
20  diverse  new  products  and  services  that  spanned  purchases  involving  major 
as  well  as  small  financial  outlays.  The  same  convenience  sample  of  53  adult 
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male  business  executives  rated  each  of  the  20  concepts  on  two  occasions  seper- 
ated  by  a  two  week  interval.  Table  1  lists  the  concepts  and  the  computed  values 
of  their  respective  test-retest  correlation  coefficients. 


INSERT  TABLE  1  HERE 


As  discussed  in  Silk  (.1977),  a  necessary  condition  for  a  test-retest  correla- 
tion coefficient  to  serve  as  an  interpretable  reliability  coefficient  in  the 
sense  of  (6)  is  that  the  variances  of  the  observed  test  and  retest  scores  be  equal. 
This  condition  appeared  to  hold  here  as  indicated  by  the  results  of  separate 
tests  for  the  equality  of  two  correlated  variances  carried  out  for  each  of  the 
concepts.  For  only  one  of  the  20  concepts  could  the  null  hypothesis  of  homo- 
geneous test  and  retest  variances  be  rejected  at  even  the  ,-20  level  (2  tail  test). 
However,  this  overall  conclusion  must  be  qualified  inasmuch  as  beta  binomial  distributed 
intention  scores  do  not  satisfy  the  distributional  assur^ption  (bivariate  normality) 
underlying  this  test  ^Snedecor  and  Cochran  1967,  pp.  195-197). 

Accepting  that  the  condition  of  stable  test  and  retest  variances  was  not 
seriously  violated,  we  may  compare  the  observed  levels  of  reliability  indicated 
by  the  test-retest  correlations  (r-j^)  with  those  expected  for  beta  binomial 
distributed  intention  scales  as  given  by  (ll)  above.  For  a  seven  point  intention 

scale  (n  =  6)  we  see  from  (11)  that  the  expected  reliability  (R  )  will  vary 

*     6 
inversely  with  the  magnitude  of  a  +  B"  according  to  R  =    r-   .  A  plot  of  this 

relationship  appears  as  the  solid  line  in  Figure  3  for  values  of  a  +  B  ranging 
0  to  7.  The  distributions  of  intention  ratings  from  the  first  test  were  used  to  esti- 
mate the  beta  binomial  parameters  a  and  B  for  each  concept  by  means  of  the  method  of 
maximum  likelihood  (Kalwani  (1980).  These  estimates  are  shown  in  Table  1  along  with 
the  expected  values  of  the  reliability  index  computed  b/   substituting  the  estimates, 
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a  +  g,  in  (11)  for  n=6.  As  indicated  by  the  values  of  the  chi  square  statis- 
tics reported  in  Table  1,  the  beta  binomial  model  generally  provided  a  good 
approximation  to  the  distributions  of  the  intention  ratings  for  these  concepts 
obtained  from  the  initial  test  occasion.  Comparing  the  computed  and  critical 
values  of  the  chi  square  statistics  shown  in  Table  1,  we  find  that  the  null 
hypothesis  that  the  intention  ratings  are  distributed  binomial  could  be 
rejected  at  the  .05  level  for  6  of  the  20  concepts  and  at  the  .01  level  for 
only  3  of  the  20  concepts.  In  Figure  3,  the  20  pairs  of  points  corresponding 
to  the  test-retest  correlation  and  the  sum  of  the  estimates  of  the  two  beta 
binomial  parameters  for  each  concept  are  also  plotted. 


INSERT  FIGURE  3  HERE 


The  concepts  are  listed  in  Table  in  descending  order  according  to  the 
magnitude  of  the  sum,  a  +  6.  In  light  of  the  great  diversity  of  the  concepts, 
the  considerable  variation  on  the  estimated  values  of  a  and  g  is  reassuring  in 
that  it  provides  a  preliminary  indication  that  the  respondents  were  at  least 
somewhat  discriminating  in  rating  the  different  concepts.  For  half  the  concepts, 
the  intentions  ratings  tend  to  be  skewed  toward  the  high  end  of  the  scale  while 
in  the  other  half  they  tend  to  be  skewed  in  the  opposite  direction.  However,  in 
only  about  a  third  of  the  cases  is  the  skewness  very  marked  in  either  direction. 

Except  for  the  air  conditioner  concept,  in  all  cases  the  sum,  a  +  3,  exceeds 
unity  --  the  range  of  values  of  the  quantity  being  from  .920  to  6.494,  Accordingly, 
for  these  data  the  reliability  coefficient,  R*  =  ^^^^  ^  would  be  expected  to  vary 
from  .867  to  .480  and  in  Table  1  we  find  the  observed  values  of  the  test-retest 
correlations  encompass  this  range,  varying  from  .840  to  .506.  More  critically, 
we  see  in  Figure  3  that  the  test-retest  correlations  tend  to  decrease  as  a  +  B 
increases  in  a  manner  that  appears  to  conform  generally  with  thfe  expected  form  of 
the  relationship  indicated  by  R*  =  ^+1+^   •  The  median  of  the  absolute  differences 
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between  the  value  of  the  expected  reliability  coefficient  (R*)  and  the  test-retest 
correlation  {r-.^)   for  the  20  concepts  is  .077  (on  the  0-1  scale),  with  the  value 
of  r,^  exceeding  that  of  R*  for  9  of  the  20  concepts.  In  light  of  the  fact  that 
the  same  respondents  rated  all  20  concepts,  it  is  noteworthy  that  the  scatter  of 
the  observed  test-retest  correlations  around  the  expected  reliability  curve  shown 
in  Figure  3  does  not  appear  to  follow  any  systematic  pattern. 

To  sum  up,  then,  these  results  indicate  that  the  reliability  of  a  seven  point 
intention  scale  behaves  in  a  manner  that  is  generally  consistent  with  Morrison's 
assumption  of  an  underlying  measurement  model  consisting  of  a  beta  distributed  true 
score  and  a  binomial  distributed  error  component.  Clearly  the  scope  of  the  evidence 
presently  available  to  assess  this  assumption  is  limited  and  further  studies  of 
reliability  are  needed,  especially  for  scales  with  different  numbers  of  scale 
points  and  for  applications  of  them  to  both  branded  packaged  and  durable  goods. 

Bias  and  Discriminant  Ability 

The  literature  on  intention  scales  shows  much  more  concern  with  bias  or 
systematic  error  than  with  random  me?isurement  error  and  reliability.  Evidence 
relating  to  systematic  errors  in  intention  scales  is  available  from  two  quite 
different  bodies  of  research.   One  group  of  studies  has  investigated  the  biases 
in  intention  measures  which  appear  when  they  are  related  to  purchase  data. 
In  the  other  set  of  studies  the  frequent  failure  of  respondents  to  discriminate 
among  alternatives  in  their  intentions  ratings  for  a  set  of  stimuli  has  been 
attributed  to  the  presence  of  a  persistent  and  stable  component  of  systematic 
error. 
1.  Intentions  and  Purchflsos 

In  the  area  of  purchases  of  generic  types  of  household  durables,  considerable 
evidence  exists  documenting  the  systematic  error  that  appears  when  responses  to 
intention  questions  or  the  Juster  probability  scale  obtained  at  one  point  in  time 
are  compared  cross-sectionally  to  reports  of  subsequent  ourchase  events  for  the 
same  sample  of  households.  An  excellent  review  and  discussion  on  this  work 
appears  in  McNeil  (1974,  1975),  Adams  (1974),  and  Juster  (1974)  and  need  not  be 
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repeated  here  except  to  note  a  persistent  findiny  in  these  "fu I t i I linent"  studies 
has  been  that  households  reporting  no  Intention  or  a  zero  purchase  probability 
were  later  observed  to  account  'or  a  very  substantial  share,  frequently  the  major- 
ity, of  subsequent  purchases  (M:Neil  1974,  p.  8  and  Juster  1974,  pp.  ^Z   and  14]. 
In  the  context  of  Morrison's  model,  such  systematic  ^fscreoancies  could  be 
interpreted  as  indications  of  instability  (p)  or  bias  (b),  or  some  combination 
of  the  two. 

Efforts  to  link  intentions  to  purchase  data  for  frequently  purchased  branded 
goods  have  been  rare  and  generally  too  limited  in  scope  to  produce  much  insight 
into  the  sources  of  sytematic  errors.  An  exception  is  the  work  of  Bird  and 
threnberg  (1966)  who  conducted  a  detailed  analysis  of  the  relationship  between 
intentions  and  usage  using  an  e> tensive  body  of  survey  data  collected  in  the 
United  Kingdom  covering  more  th^in  100  brands  from  20  product  dategories.  The 
intentions  variable  investigated  was  the  percentage  of  respondents  who  selected 
a  particular  brand  from  a  designated  list  when  asked  a  straightforward  question 
about  which  brand  they  were  "likely  to  buy  in  the  future".  Current  usage 
was  similarly  defined  as  the  percentage  of  respondents  who  claimed  to  have 
used  the  brand  within  some  specified  time  period,  fhe  bulk  of  their  analysis 
focused  on  the  concurrent  relationship  between  intention  and  usage  levels  across 
brands  within  the  same  product  category. 

The  finding  reported  by  Bird  and  Ehrenberg  (1966,  p.  32)  of  particular 
here  was  that  for  brands  with  stationary  usage  levels,  the  percentage  of  respon- 
dnets  expressing  an  intention  to  buy  exceeded  the  percentage  claiming  to  be  users- 
"for  all  brands  and  product  fields,  irrespective  of  usage  definitions." 

Bird  and  Ehrenberg  went  on  to  show  that  the  likelihood  of  a  respondent 
expressing  an  intention  to  buy  a  brand  was  strongly  associated  with  the  recency 
of  the  respondent's  past  usage  of  that  brand.  As  a  "^ypical  example",  they 
presented  data  which  showed  that  the  proportion  of  respondents  intending  to 
buy  a  brand  was  .9b  for  current  users  of  the  brand,  .45  for  those  who  were  not 
current   users  but  had  used  the  brand  within  the  past  6  months,  .10  for  those 
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who  had  last  used  the  brand  more  than  6  .lonths  ago,  and  .05  for  those  who  had  never 
used  it.  Bird  and  Ehrenberg  note  that  purchase  freauency  will  tend  to  be 
reflected  in  the  reported  recency  of  last  use--i.e.,  recency  of  use  will  tend  to 
be  correlated  with  frequency  of  purchase.  Thus  the  decline  in  the  oroportion 
expressing  intentions  as  the  time  since  last  use  increases  implies  some  correlation 
between  expressed  intentions  and  purchase  frequency.  But  it  is  clear  from  the 
above  example  that  when  the  overall  usage  level  is  stationary  over  time,  expressed 
intentions  will  overstate  the  actual  frequency  of  purchase. 

The  fact  that  the  likelihood  of  expressing  an  intention  to  purchase  is 
associated  with  recency  of  past  use  suggests  that  this  may  be  an  example  of  how 
biases  arise  in  judgements  about  uncertain  events  due  to  respondents'  reliance 
on  what  Tversky  and  Kahneman  (1974,  p.  1130}  call  the  "availability  heuristic"— 
"the  tendency  to  assess  the  relative  frequency  of  an  event  by  the  ease  with 
which  iinstances  or  occurrences  can  be  brought  to  mind."  Recent  purchases 
are  more  likely  to  be  recalled  than  previous  ones  and  respondents  may  under- 
estimate their  interpurchase  times  and  hence  overestimate  their  frequency  of 
purchasing.  There  is  considerable  evidence  from  other  studies  of  the  errors 
respondents  make  in  reporting  past  events  (as  opposed  to  intentions)  that  for 
frequently  occurring  events,  there  is  a  tendency  for  the  total  number  of  events 
to  he  overreported  ( Sudman  and  Bradhurn  1974,  Chapter  3).  This  phenomenon  is 
a  result  of  "telescoping  error"  whereby  an  event  is  remembered  as  having  taken 
place  more  recently  than  it  actually  occurred.  Faced  with  the  task  of  answering 
an  intention  question  asked  with  reference  to  a  particular  brand  of  some  frequently 
purchased  product,  a  respondent  may  seek  to  make  a  judgement  about  the  likelihood 
of  purchasing  the  brand  in  the  future  based  on  his/her  recall  of  past  purchases. 
If  such  occurs,  then  we  would  expect  that  the  functioning  of  the  normal  memory  process 
would  involve  telescoping  and  thereby  produce  a  response  which  overstates  his/her 
true  purchase  frequency. 
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It  is  more  difficult  to  conjecture  about  the  judgemental  heuristics  underlying 
responses  to  intention  questions  for  durable  goods.  The  major  known  bias  noted 
above,  the  tendency  for  those  with  no  intention  or  a  zero-purchase  probability 
to  make  a  substantial  number  of  purchases,  suggests  that  the  bias  is  of  the 
opposite  sign  and  fundamentally  different  from  that  which  appears  to  persist  for 
packaged  goods.  The  extended  "post-mortem"  reviews  by  McNeil  0974)  and  Ouster 
0974)  of  the  Bureau  of  Census'  experience  with  the  Survey  of  Consumer  Buying 
Expectations  give  the  impression  that  reasons  for  this  well -documented  downward 
bias  remain  something  of  a  mystery. 

Dirkson  and  Wilkie  C1978,  p.  86)  take  a  different  view.  They  contend  that 
"A  survey  of  the  literature,.. reveals  several  sensible  explanations  for  the 
high  incidence  of  'unplanned'  purchases  and  unfulfilled  planned  purchases"  and 
go  on  to  argue  that  "few  durable  purchases  can  be  viewed  as  unplanned  when 
contingencies,  priorities,  and  situational  constraints  are  considered." 

In  terms  of  the  parameters  of  Morrison's  model,  it  could  be  that  the 
respondents  with  initially  low  or  non-existent  intentions  are  the  group  most  likely 
to  experience  a  positive  change  in  true  intention  as  a  result  of  unforeseen  events 
such  as  the  unexpected  breakdown  of  durables.  Failure  to  consider  contingencies 
under  which  one  would  replace  an  existing  durable  would  lead  to  an  underestimate 
of  a  purchase  probability  and  could  be  an  example  of  an  "imaginability"  bias 
(Tversky  and  Kahneman  1974,  p.  1127).  Meter  (1970,  pp.  16-17)  reviews  some  limited 
evidence  which  suggests  that  recall  of  the  purchase  dates  of  a  major  appliance 
is  subject  to  telescoping  error.  If  so,  respondents  would  tend  to  understimate 
the  age  of  an  appliance  and  f-e  probability  of  having  to  replace  it.  Clearly  much 
remains  to  be  learned  here  and  a  better  understanding  of  the  judgement 
heuristics  underlying  response  biases  could  be  useful  in  improving  the  design 
of  measuring  instruments. 


■20- 


2.  Response  Style 

It  has  frequently  been  observed  that  in  providing  intention  ratings  for 
packaged  goods  stimuli,  substantial  numbers  of  respondents  give  identical  or 
yery   similar  response  to  the  entire  set  of  alternatives  they  are  asked  to  rate. 
This  tendency  has  generally  been  interpreted  as  a  manifestation  of  some  type 
of  response  style  bias  (or tendency  to  select  a  particular  scale  response 
category  a  disproportionate  amount  of  the  time,  independent  of  the  scale  content 
or  referent)  suchas  acquiesence  or  yeasaying/naysaying  (Wells  1961). 

The  problem  is  particularly  troubling  when  intention  scales  are  used  as 
the  criterion  measure  in  concept  tests.  Several  studies  have  found  that  approx- 
imately 30  per  cent  of  respondents  are  "non- discriminators"  with  respect  to 
their  intention  ratings  of  alternative  stimuli  (Clancy  and  Garsen  1970,  Gold 
and  Salkind  1974,  Ptacek  and  Ross  1979).  Research  on  psychological  testing  has 
suggested  that  response  style  tendencies  are  most  likely  to  be  operative  when 
subjects  are  ambiguous  or  uncertain  about  how  to  respond  (cronbach  1950,  Jackson 
1967).  Hence, intentions  ratings  obtained  in  concept  tests  may  be  especially 
prone  to  response  style  bias  when  respondents  have  only  a  limited  exposure  to 
the  concept  stimuli  and/or  when  the  concepts  are  highly  similar. 

The  amount  and  source  of  variability  of  intentions  ratings  may  be  illustrated 
here  by  examining  the  results  of  some  further  analysis  of  the  data  from  Davis' 
study  described  above  in  the  discussion  of  reliability.  In  particular,  we  carried 
out  an  analysis  of  variance  using  the  following  mixed  model  for  repeated  measures: 


where: 


^ct  -^--,^^c'\'   ^^^)ic  -^  ^^T)it  "  ^C'^k  -^  S-ct 


X.  .  =  respondent  i's  rating  of  concept  c  on  occasion  t 
^  on  the  seven-point  intention  scale. 


grand  mean, 
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a.  =  effect  of  respondent  i,  i  =  1,2,..., 63, 

C^  =  effect  of  concept  c,  c  =  1,2 20, 

T.  =  effect  of  occasion  t,  t  =  1,2, 

(aC).  =  effect  of  interaction  of  respondent  i  with  concept  c, 

(aT)..  =  effect  of  interaction  of  respondent  i  with  occasion  t, 

(CT)  ^  =  effect  of  interaction  of  concept  c  with  occasion  t. 


The  effects  of  concepts  and  occasions  are  taken  as  fixed  while  respondents  are 
assumed  to  be  random.  The  ANOVA  summary  is  presented  in  Table  2.  Note  our  use 
of  the  ANOVA  results  is  confined  to  obtaining  estimates  of  the  contributions  made 
by  the  different  effects  posited  in  the  above  model  to  the  total  variance  of  the 
observed  intention  ratings.  The  assumptions  of  normality  and  homoscedasticity 
of  the  error  component  required  for  F  tests  of  hypothesis  to  be  strictly  valid 
and  for  estimates  of  the  sampling  errors  of  parameters  to  have  optimum  statistical 
properties  will  not  in  general  hold  for  beta  binomial  distrubuted  intentions 
ratings  and  are  not  satisfied  for  the  particular  set  of  data  under  consideration 
here.  We  are  interested  in  the  above  ANOVA  model  as  a  means  of  summarizing  the 
sample  data  rather  than  as  a  basis  for  making  statistical  inferences  and 
tests.  The  variance  components  shown  in  Table  2  were  obtained  by  setting  the 
mean  squares  for  the  effects  equal  to  their  corresponding  expected  values  and 
solving  the  resulting  equations  (Cronbach  et  al .  I97ii,  Chapt.  2j. 


INSERT  TABLE  2  HERE 


We  interpret  the  variance  associated  with  the  main  effect  of  respondents 
as  being  largely  attributable  to  individual  differences  in  response  styles. 
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although  any  differences  among  respondents  with  respect  to  some  general 
"innovativeness"  trait  would  also  he  reflected  in  this  component.  In  any  case, 
it  is  noteworthy  in  Table  2  that  the  variance  component  attributable  to  respondent 
differences  across  concepts  is  one  and  a  half  times  greater  than  that  due  to 
mean  differences  among  this  highly  diverse  set  of  concepts.  In  a- normal  concept 
test,  conducted  on  a  single  product  category,  one  would  expect  that  the  set  of 
stimuli  investigated  would  be  more  homogeneous  than  the  broad  range  of  alterna- 
tives listed  in  Table  1  and  hence  an  even  smaller  amount  of  variation  due  to 
between  concept  mean  differences  would  likely  be  observed. 

The  largest  variance  component  in  Table  2  is  that  which  is  attributed  to  the 
interactions  between  respondents  and  concepts  and  accounts  for  almost  half  (46  per 
cent)  of  the  total  variance  in  the  observed  intention  ratings.  This  indicates  a 
substantial  amount  of  idiosyncratic  discrimination  among  concepts  by  respondents. 
It  is  interesting  to  note  that  the  index  Haley  and  Case  (1979)  used  to  compare 
the  discriminant  ability  of  several  widely  used  rating  scales  is  one  which 
essentially  defines  discrimination  solely  in  terms  of  the  magnitude  of  the  main 
effects  of  between  stimuli  mean  differences.  The  type  of  design  they  employed 
does  not  allow  any  effects  due  to  the  interaction  of  respondents  with  stimuli 
to  be  separated  from  the  residual  variance .  As  suggested  by  Gold  and  Salkind 
(1974,0.  22),  more  information  about  the  appeal  of  alternatives  to  sub-segments 
of  respondents  might  be  obtained  from  concept  or  brand  discrimination  tests 
employing  intention  scales  if  each  respondent  rated  each  stimulus  twice 
rather  than  once  which  appears  to  be  the  usual  practice. 

Are  More  Scales  Points  Better? 

The  expression  (11)  for  the  reliability  of  beta  binomial  intention  scales 
presented  earlier  in  this  paper  implies  that  the  reliability  of  such  a  scale  can 
be  increased  by  expanding  the  number  of  scale  points  it  contains.  Unfortunately, 
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no  relevant  evidence  bearing  directly  on  this  issue  appears  to  be  available 
presently.  In  his  extensive  recent  review  of  the  literature  dealing  with  the 
"optimal"  number  of  response  alternatives  for  a  scale.  Cox  U980,  p.  48)  concludes 
that  "the  empirical  studies  provide  additional  ,  consistent  evidence  suggesting 
that  increasing  the  number  of  alternatives  may  increase  the  reliability  of  a  scale 
but  that  the  potential  is  probably  minor  in  comparison  with  other  means."  However, 
the  bulk  of  the  evidence  Cox  discusses  pertains  to  composite  rather  than  single 
item  scales  such  as  intention  measures. 

Cox  draws  attention  to  another  issue  raised  some  years  ago  by  Cronbach  (1950) 
who  maintained  that  the  reduction  in  random  measurement  error  obtained  by  expanding 
the  number  of  response  alternatives  would  be  accompanied  by  a  large  component  of  system- 
atic error  due  to  response  style  bias.  Cronbach  (1950,  p.  22)  suggested  that 
"the  argument  that  the  finer  scale  gives  more  reliability  is  not  a  sound  one, 
since  this  is  what  we  would  expect  if  all  of  the  added  reliable  variance  were 
response-set  vartance  and  had  no  relation  to  the  beliefs  about  the  attitude 
object  in  question."  If  the  advantage  with  respect  to  reliability  of  scales  with 
more  response  alternatives  is  offset  by  an  increase  in  response  style  bias,  then 
their  discriminant  and  predictive  ability  would  also  be  adversely  affected. 

Some  support  for  Cronbach' s  (1950)  warning  that  scales  with  more  response 
alternatives  may  be  more  afflicted  by  response  style  bias  than  scales  with 
fewer  levels  is  found  in  Haley  &  Case's  (1979)  study.  Their  results  (p.  29) 
indicate  that  a  five  point  intention  scale  discriminated  somewhat  better  among 
brands  than  did  the  version  of  the  eleven  point  purchase  probability  scale  they 
employed.  This  directional  result  appeared  in  four  of  the  six  product  categories 
investigated  but  in  two  of  the  aforementioned  four  categories  the  differences  were 
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slight.  Furthermore,  in  contrast  to  the  original  Juster  scale  and  Wells's  (1961) 
ten  point  "readiness  to  buy"  scale,  Haley  and  Case's  purchase  probability  scale 
included  descriptive  labels  for  only  four  of  the  eleven  response  alternatives.  As 
they  suggest  (  p.  31)  this  feature  of  their  eleven  point  scale  may  have  dimi- 
nished its  discriminant  ability. 

Thus  the  relative  merits  of  longer  versus  shorter  intention  scale  remain 
an  unresolved  question.  A  well-designed  study  that  investigates  thte  interactions 
and  tradeoffs  between  reliability  and  discriminant  ability  for  scales  of  different 
lengths  is  needed  to  untangle  the  issues. 

In  general  then,  the  evidence  available  from  investigations  of  the  discri- 
minant quality  of  intentions  ratings  and  from  direct  comparisons  of  intention 
and  purchasing  levels  both  indicate  the  presence  of  a  sizable  component  of  syste- 
matic error.  While  the  sources  of  such  biases  are  not  well  understood,  the  task 
with  which  respondents  are  confronteu  in  reporting  their  intentions  for  a  generic 
type  of  durable  good  would  appear  to  be  quite  different  than  when  asked  to 
express  their  intentions  with  respect  to  a  specific  brand  of  some  packaged 
good  if  only  because  of  inherent  differences  in  the  length  and  regularity  of 
interpurchase  times  for  these  two  types  of  purchases.  Thus  it  is  of  consider- 
able interest  to  explore  the  application  of  Morrison's  model  over  a  broad  range 
of  purchase  decisions.  In  the  next  section  we  present  some  further  results 
obtained  by  fitting  the  model  to  data  for  both  durable  and  packaged  goods. 

APPLICATIONS  AND  TESTS  OF  THE  INTENTION-PURCHASE  MODEL 
Data 

The  essential  data  needed  to  estimate  the  parameters  of  Morrison's  model 
are  the  marginal  distribution  of  purchase  intentions  measured  at  one  point  in 
time  and  information  about  the  incidence  of  subsequent  purchasing  by  the  same 
respondents  tabulated  according  to  the  level  of  their  previously  expressed 
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intentions.  A  careful  search  of  the  published  marketing  research  literature 
uncovered  a  few  data  matrices  of  the  aforementioned  type  which  were  reported  in 
sufficient  detail  to  permit  the  re-analysis  required  to  estimate  and  test  Morri- 
son's model.  Table  3  lists  the  sources  of  the  data  used  here  and  summarizes  the 
pertinent  features  of  each  data  set  and  the  manner  in  which  it  was  collected. 


INSERT  TABLE  3  HERE 


Estimation  and  Testing 

The  frequency  distribution  of  intention  scale  responses  is  used  to  obtain 
maximum  likelihood  estimates  of  the  beta  binomial  model  parameters.  Observed 
purchase  frequencies  for  respondents  with  various  stated  intention  values  are 
used  for  estimating  the  parameters  of  the  linear  model  relating  purchase  proba- 
bility and  stated  intention.  Finally,  a  likelihood  ratio  test  is  used  to  compare 
the  goodness-of-fit  of  alternative  models.  A  description  of  each  of  these  estima- 
tion and  testing  procedures  follows. 
1.  Estimation  of  the  Beta  Binomial  Model 

If  the  random  variable  x  (=0,1,..., n)  is  distributed  beta-binomial  with 


parameters  a  and  g,  then  the  probability  of  observing  I  =  x/n  is: 


Pii   )   =  fn\  B(a+x,B+n-x) 


U7) 


where  B(a.6)  =  r(a)r(.6)/r^a+B)  and  r^°)  denotes  the  gamma  function.  Let  N  repre- 
sent the  number  of  households  with  a  response  of  x  on  a  stated  intention  scale 
ranging  from  0  to  n.  Then,  the  likelihood  function  for  ovserving  frequencies 
N  's  is  given  by: 

UN,'s;a,6)  =  j^  P(I^)  ^  ^  ^18J 
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Constant  terms  ignored,  the  logarithm  of  the  likelihood  function  simplifies  to: 

n        rx-1  n-x-1  n-1  ^ 

L(N   's-.a.B)   =     Z  N  <     s       log(a+r}  +       E         log(e+r)   -     I       log(a+6+r)  >         (19) 
^  x=0  '^  Lr=0  r=0  r=0  J 

The  maximum  likelihood  estimates  of  a  and  6  can  be  obtained  by  maximizing  the 

log-likelihood  function  in  equation  (19)  through  numerical  optimization 

(Kalwani  1980).  Standard  errors  of  the  maximum  likelihood  estimators  are 

obtained  from  the  inverse  of  the  information  matrix  using  the  Cramer-Rao  lower 

bound  for  the  asymptotic  variance  of  estimators  (Rao  1965,  pp.  289-302). 

2.  Estimation  of  the  Linear  Model 

As  noted  above,  Morrison  (1979)  proposed  a  linear  model  for  the  relationship 

between  purchase  probability  and  stated  intention.  That  is: 

P^  =  A  +  BI^  ,  (20) 

where  P^  is  the  purchase  probability  of  a  respondent  with  stated  intention 
I^.  Equation  (18)  indicates  that  the  intercept  and  slope  terms  are  given  by: 

A'^F-irS^-''   .  (21) 

where  p  is  the  probability  that  there  is  a  change  in  an  individual's  true  intention 
and  b  is  the  systematic  bias. 

Let  N^  represent  the  number  of  respondents  with  purchase  intention  I  who  buy 
the  product  and  m  =  N  -n  the  number  who  do  not  buy  the  product.  Then,  the 
likelihood  function  for  observing  the  purchase  frequencies  n  's  and  m  's  is  given  by: 
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"     "x     \ 
£(n^'s,m^'s;A,B)  =  n  P^  [^-P^)     ,  (23) 

x=0 
where  P  =  A+BI  .  Taking  logarithms,  we  obtain  the  log-likelihood  function: 

"  (  ) 

We  find  the  maximum  likelihood  estimates  of  A  and  B  by  maximizing  the 
log-likelihood  function  in  equation  (24)  through  numerical  optimization. 
Standard  errors  of  maximum  likelihood  estimators  are  obtained,  as  before, 
from  the  inverse  of  the  information  matrix  using  the  Cramer-Rao  lower  bound 
for  the  asymptotic  variance  of  estimates  (Rao  1965,  pp.  289-302). 

Equation  (22)  is  used  to  obtain  the  maximum  likelihood  estimate  of  p 
from  the  maximum  likelihood  estimates  of  a,  B  and  B,  that  is: 

p  =  1  .  B  (^^^""^  .  (25) 

If  b  is  estimated  from  equation  (16),  then  our  model  suggests  that  the 
estimated  value  of  A  should  be  given  by  equation  (21). 
Tests  of  Alternate  Models 

We  use  the  likelihood  ratio  test  to  compare  the  goodness-of-fit  of 
alternate  models.  First,  we  compare  the  linear  model: 

Px  =  A  +  BI^  ,  (26) 

with  the  null  hypothesis  P  =  A  with  B  set  equal  to  zero.  For  this  purpose, 
we  form  the  ratio  of  the  maximum  value  of  the  likelihood  function  of  the 
unconstrained  model  under  the  null  hypothesis  (say,  model  0)  to  that  for 
the  unconstrained  model  (say,  model  1).  That  is: 

^01 


A..  =A^ 


a   U) 


where  the  asterisk  denotes  that  it  is  the  maximum  value.  For  large  samples, 
Wilks  (1962,  Chapter  13)  has  shown  that  -21oggA  is  chi-square  distributed 
with  1  degree  of  freedom  representing  the  difference  in  the  number  of  the 
parameters  of  the  unconstrained  and  constrained  models. 

We  also  compare  the  linear  model  with  the  piecewise  linear  model.  The 
mathematical  form  of  the  latter  model  is: 

P^  =  A  .  B,I^  ,  it   I,  :  i;  .  (27A) 

'.  -  I'  *  '/.  *  ^2^'k-O    •  ^f'x>■^  <^'8' 

Figure  4  provides  a  graph  of  the  piecewise  linear  model.  The  linear  model 
given  in  equation  (26)  is  a  constrained  version  of  the  piecewise  linear  model 
in  equation  (27).  The  chi-square  statistic  from  the  likelihood  ratio  test 
in  this  case  has  2(=4-2)  degrees  of  freedom. 


INSERT  FIGURE  4  HERE 
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RESULTS 

In  this  section,  we  present  empirical  results  from  analyses  of  stated 
intentions  and  purchase  data  for  durable  and  non-durable  goods.  For  each 
product  we  report  parameter  estimates  and  their  standard  errors  for  the  beta 
binomial  model  describing  purchase  intentions  (i.e.,  a,  g,  SE(a)  and  SE(B))  and  the 
linear  model  linking  stated  intentions  to  purchase  probabilities  (i.e., 
A,  B,  SE(AJ,  and  SE(B)).  As  indicated  previously,  the  maximum  likelihood 
estimate  of  the  parameter  p  representing  the  probability  of  change  in  a 
given  individual's  true  intention  is  obtained  from  a,  B  and  B  using  equation 
(22). 

We  employ  four  different  performance  measures  to  evaluate  the  models- 
linear  and  piecewise--linking  stated  intentions  to  purchase  probabilities. 
The  first  measure  is  a  chi-square  statistic  obtained  from  the  likelihood 
ratio  test  as  described  in  the  previous  section.    Non-signifi- . 
cant  values  of  the  statistic  indicate  that  the  null  hypothesis  cannot  be 
rejected  in  favor  of  the  alternate  hypothesis.  We  perform  two  likelihood 
ratio  tests  for  each  durable  and  packaged  good  data  set.  In  one  case  we 
treat  the  linear  model  as  the  alternate  hypothesis  with  P^  =  A  as  the  null 
model  and  in  the  other  case  we  posit  the  piecewise  linear  model  as  the 

alternate  hypothesis  with  the  linear  model  as  the  null  hypothesis. 

2 
The  second  performance  measure  is  denoted  by  R  and  is  defined  as  in 

regression  analysis  as  the  ratio  of  "explained  variance"  to  "total  variance". 

It  is  given  by: 


9      x-0  ^  ^    ^ 
R^  =  1  -  "n       :~7  ' 

x=0  ^  ^    ^ 

where  N^  =  the  number  of  respondents  with  stated  intention  I^  =  x/n, 

n  =  the  number  of  respondents  with  stated  intention  who  purchase  the 

product. 


(28) 


P  =  (  ^  nJ/(  E  N J, 
^    x=0  ^   x=0  ^ 

P  =  the  predicted  purchase  probability  given  that  the  stated  intention 

The  numerator  of  the  second  term  on  the  right-hand  side  of  equation  (28)  is,  of 

course,  the  unexplained  variance  and  the  denominator  is  the  total  variance.  As  in 

2 
regression,  R  is  a  measure  of  the  goodness-of-fit  of  the  hypothesized  model ~ 

linear  or  piecewise--to  the  observed  purchase  data. 

We  also  report  the  standard  error  of  estimate,  S,  in  each  case.  Recall 

2 
that  S  is  defined  as  : 

n        .  ^ 

s2  =  XfO ^ ^,  (29) 

N  -  k  -  1 


where  N  =  z  N  ,  k  is  the  number  of  model  oarameters,  and  the  other  terms 
x=0  ^ 

are  defined  as  in  equation  (28J.  The  standard  error  of  estimate  provides  a 
measure  of  the  error  associated  with  predicting  individual  purchase  probabilities 

and  may  be  used  to  construct  the  confidence  interval  around  the  predicted  values. 

2 
The  final  performance  measure,  we  report,  is  denoted  by  Rj^  and  is  defined 


as  (see  McFadden  1970,  p.  121) 
,2  =  1  _  lIiai 


R^  =  1  -  V^  ,  (30) 


L  (N) 


where  L  (A)  and  L  (N)  represent  the  maximum  values  of  the  log- likelihood  functions 

2 
for  the  alternate  and  null  models  respectively.  The  measure  Rj^  takes  the  value 

one  when  L  (A)  is  zero  which  happens  when  the  likelihood  function  is  a  product 

2 
of  unit  probabilities.  In  other  words,  R^^  takes  the  value  one  when  with  the 

alternate  model  .  we  are  able  to  nredict  outcomes  (i.e.,  purchase  vs.  non-purchase) 
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with  probability  one.  If  the  purpose  of  models  linking  prior  intentions  to 

purchase  probabilities  is  to  compare  the  overall  sample  response  to  alternate 

7  2 

stimuli,  then  Rj^  is  a  less  appropriate  performance  measure  than  R  .  We  report 

R,^  here  for  completeness.  Below  we  present  the  results  from  data  on  durable 

and  branded  packaged  goods. 

Durable  Goods  Results 

In  the  case  of  durable  goods,  we  report  results  for  two  sets  of  data. 

The  first  data  set  is  from  Juster  (1966)  and  was  used  by  Morrison  (1979)  to 

illustrate  the  application  of  his  model.  The  source  of  the  second  data  set  is 

Pickering  and  Isherwood  (1974).  Table  4  displays  the  parameter  estimates  along 

with  the  four  performance  measures  for  the  Juster  data.  Estimates  of  the  beta 
binomial  model  parameters  a  and  g  were  obtained  from  the  stated  tntention  fre- 
quency data,  naiDely  the  N^'s. 


INSERT  TABLE  4  HERE 


Table  5  reports  the  actual  and  predicted  purchase  intention  frequencies 
(rounded  to  the  nearest  integer)  for  the  Juster  data.  Predicted  frequencies 
were  obtained  from  S  and  g.  Examination  of  the  results  in  Table  5  reveals  that 
the  beta  binomial  model  provides  a  very  good  fit  to  the  stated  intention  data 
in  all  four  cases.  In  three  of  the  four  cases,  the  x  statistics  are  less  than 
the  critical  value  of  26.125  (with  8  degrees  of  freedom  and  1  percent  signifi- 
cance level). 

INSERT  TABLE  5  HERE 


In  addition  to  the  estimates  of  a  and  3,  Table  4  also  displays  the  find- 
ings from  fitting  the  linear  model  to  the  Juster  data. 


-32- 


2 

Examination  of  the  x-]  values  reveals  that  they  are  all  greater  than 

2 
6.635  which  means  that  they  probability  of  obtaining  each  of  these  x-i  values 

is  less  than  0.01.  Therefore,  we  reject  the  null  hypothesis  that  predicted 

purchase  probabilities  are  independent  of  stated  intentions.  We  conclude  that 

the  linear  model,  Px  ^  '^  "*"  ^^x'  P'"°^''^^^  ^  superior  fit  to  the  data  than  the 


2 
The  values  of  R  provide  an  indication  of  the  goodness-of-fit  of  the 

linear  model.  Figures  5  through  8  provide  a  graphical  display  of  the  quality 

of  the  fit.  It  is  worthwhile  noting  that  for  automobiles,  12  months  intention 

data  (Figure  6)  with  R^  =  0.972  and  standard  error,  S  =  0.027  fit  six-month 

purchases  better  than  do  six-month  intentions  data  (Figure  5)  with  R  =  0.772 

and  standard  error,  S  =  0.069.  For  household  appliances,  six-month  intentions 

2 
data  were  not  obtained  but  twelve-month  intentions  data  (Figure  7)  with  R  = 

0.890  and  standard  error,  S  =  0.009  fit  six-month  purchase  data  better  than 

do  twenty-four  month  intentions  data  (Figure  8)  with  R  =  0.752  and  standard 

error,  S  =  0.019.  As  Morrison  (1979,  p.  72)  notes,  this  raises  the  question 

of  the  optimum  time  horizon  (say,  x  months)  for  collecting  intentions  data  given 

that  purchase  predictions  over  a  period  of  certain  length  (say,  y  months) 

are  desired.  The  analysis  of  the  stated  intentions  from  Juster  (1966)  reported 

in  Morrison  (1979)  clearly  indicates  that  respondents  are  sensitive  to  the 

time  horizon.  For  both  automobiles  and  household  appliances,  there  is  a  shift 

in  the  frequency  distribution  of  stated  intentions  from  a  lesser  towards  a 

greater  intent  to  purchase  the  product  (1^=0  to  1^=1)  as  the  time  horizon 

increases. 


INSERT  FIGURES  5-8  HERE 
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The  standard  errors  of  estimate  are  given  in  the  second  to  last  column  of 
Table  4.  As  an  illustration  of  their  implication,  consider  the  data  set, 
Autos  II.  The  predicted  purchase  probability  for  those  respondents  with 

P  =  A  +  BI  =  0.071  +  0.528(2/10)  =  0.177. 

Assuming  P  is  normally  distributed,  the  95%  confidence  interval  around  P^^ 
is  given  by  P^  +  2S  =  0.177  +  2(0.027),  or,  0.123  to  0.231. 

The  estimates  of  p  in  Table  4  are  obtained  from  equation  (25).  Note 
that  p's  for  automobiles  are  smaller  than  those  for  household  appliances 
indicating  that  respondents'  intentions  to  buy  automobiles  are  more  stable 
than  in  the  case  of  appliances. 

Finally,  Table  4  also  displays  numerical  values  of  bias  b  due  to  system- 
atic errors  for  automobile  and  appliance  data.  It  is  estimated  as  follows: 

b  =  _£_«_  + _ll=£i_^  _  A  . 
a  +  g   a  +  e  +  n 

Positive  values  of  b  indicate  that  the  mean  of  expressed  intentions  exceeds 

the  observed  proportion  of  purchases.  It  is  worth  noting  that  the 

maximum  likelihood  estimates  of  A  are  stable;  that  is,  the  associated 

standard  errors  of  estimate  yield  t-values  of  over  4.6.  This  finding  indicates 

that,  in  case  of  durable  goods,  the  assumption  of  constant  bias  across 

respondents  is  not  unreasonable  since  to  the  extent  that  systematic  erros  due 

to  response  set  tendencies  or  other  sources  vary  across  respondents  we  would 

observe  large  variation  in  A. 

Table  6  displays  the  results  for  the  Pickering  and  Isherwood  data. 

Estimates  of  the  beta  binomial  model  parameters  a  and  3  are  given  in  columns 

3  and  4.  For  space  reasons,  we  do  not  report  the  observed  and  predicted 

stated  intention  frequencies  for  the  Pickering  and  Isherwood  data.  None  of 

the  computed  x  values  exceeded  the  critical  value  of  26.125  (with  8  degrees 
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of  freedom  and  1  percent  significant  level)  and  the  quality  of  fit  was  com- 
parable to  that  of  the  Ouster  data. 

All  x-.  statistics  (except  that  for  black  and  white  television)  in  Table 

2 
6  are  significant  at  the  1  per  cent  level.  The  sole  exception  of  x-]  =  3.313 

in  the  case  of  black  and  white  television  is  significant  at  the  10  per  cent 

level.  Therefore,  we  reject  the  null  hypothesis  that  P  =  A.  Once  again,  we 

conclude  that  for  durable  goods  the  linear  model  provides  a  superior  fit  to  the 

2 
data  than  the  null  model,  P  =  A.  The  R  values  are  reasonably  high  with  the 

2  2 

median  R  =  0.535.  Numerical  values  of  standard  errors  of  estimate  and  Rj.  are 

comparable  to  those  of  the  Juster  data. 

As  in  the  case  of  the  Juster  data,  we  find  that  the  estimates  of  A  are 

stable  (i.e.,  the  t-values  are  statistically  significant).  Once  again,  this 

finding  indicates  that  the  assumption  of  constant  bias  is  tenable  in  the  case  of 

durable  goods. 


INSERT  TABLE  6  HERE 


A  major  finding  from  the  durable  goods  results  reported  in  Tables  4  and  5  is 
that  the  linear  model  provides  a  significantly  better  fit  than  the  null  model 
(P  =  A)  with  purchase  probabilities  being  independent  of  intentions.  A  natural 
question  that  arises  is  whether  there  are  other  models  of  a  relatively  simple 
form  which  would  provide  better  fits  to  the  data  than  does  the  linear  model.  One 
reasonable  alternative  non-linear  model  is  the  piecewise  linear  model.  Its 
mathematical  form  is  given  in  equation  (27)  and  a  graphical  representation  is 
shown  in  Figure  4.  As  before,  we  employ  the  likelihood  ratio  test  to  compare  the 
goodness-of-fit  of  the  linear  and  piecewise  linear  models. 

Results  of  this  comparison  for  the  Juster  data  are  displayed  in  Table  7. 
In  all  four  cases  the  null  hypothesis  cannot  be  rejected  at  the  1  per  cent 
levels.  We  conclude  that  the  linear  model  provides  a  superior  fit  to  the 
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2       2 
Juster  data  than  the  piecewise  linear  model.  The  improvement  in  R  (and  R^,  ) 

in  going  from  the  linear  to  the  piecewise  linear  model  is  small.  Similarly, 

the  reduction  in  standard  error  of  estimate  is  small. 


INSERT  TABLE  7  HERE 


We  also  carried  out  a  similar  comparison  of  the  linear  and  piecewise  linear 

models  for  the  Pickering  and  Isherwood  durable  goods  data.  The  results  are  similar 

2 
to  those  of  the  Juster  data  and  are  presented  in  Table  8.  The  x^     statistics  obtained 

from  the  likelihood  ratio  tests  are  not  significant  at  the  1  per  cent  level  in  any 

2       2 
of  the  ten  cases.  The  improvements  in  R  (and  R^^  )  and  the  reductions  in  the 

standard  error  of  estimate  are  small.  From  all  these  findings,  we  conclude 

that  for  durable  goods,  the  linear  model  constitutes  a  reasonable  hypothesis 

for  relating  purchase  intentions  to  purchase  probabilities. 


INSERT  TABLE  8  HERE 


Packaged  Goods  Results 

We  report  results  for  five  packaged  goods.  The  sources  of  the  data  investigated 
here  are  listed  in  Table  3  which  also  provides  some  descriptive  information 
about  the  studies  from  which  they  were  obtained.  Table  9  displays  the  para- 
meter estimates  along  with  the  four  performance  measures  for  the  five 
packaged  goods. 


INSERT  TABLE  9  HERE 


The  estimates  of  the  beta  binomial  model  parameters  a  and  6,  as  in  the 
case  of  durable  goods,  were  obtained  from  the  stated  intention  frequency  data. 
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N  's.  Table  10  reports  the  actual  and  predicted  intention  frequencies  (rounded 
to  the  nearest  integer)  for  the  five  packaged  goods.  Predicted  frequencies 
were  obtained  from  a  and  6  .  Examination  of  the  results  in  Table  10  reveals 

that  the  beta  binomial  model  does  not  provide  a  fit  of  comparable  quality  to 

2 
that  obtained  for  the  durable  goods  data.  In  three  of  the  five  cases,  the  x 

statistics  are  greater  than  the  critical  values  at  the  1  per  cent  significance 
level.  In  the  case  of  Products  B  and  C  from  Tauber  (1975),  however,  the  fit  of 
the  beta  binomial  model  is  reasonably  good.  In  summary,  the  evidence  for  pack- 
aged goods  presented  here  with  regard  to  the  beta  binomial  hypothesis  is  inconclusive. 


INSERT  TABLE  10  HERE 


2 
The  xi  values  from  the  likelihood  ratio  test  comparing  the  fit  of  the 

linear  model  and  the  null  model,  P  =  A,  in  Table  9  all  exceed  6.635  and,  there- 
fore, the  null  hypotheis  that  the  predicted  purchase  probabilities  are  independent 

2 
of  stated  intentions  is  rejected  at  the  1  per  cent  level.  The  values  of  R  are 

reasonably  high  suggesting  that  the  linear  model  provides  a  fairly  good  fit  for 

the  packaged  goods  data. 

Table  11  displays  parameter  estimates  along  with  the  performance  measures 
for  the  piecewise  linear  model.  The  chi -square  statistic  has  2  degrees  of 
freedom  and  is  obtained  from  the  likelihood  ratio  test  with  the  linear  model 
as  the  null  hypothesis  and  the  piecewise  linear  model  with  a  threshold  effect 
(see  Figure  3)  forms  a  reasonable  alternative  hypothesis  for  obtaining  purchase 
probabilities  from  stated  intentions. 
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INSERT  TABLE  11  HERE 

The  numerical  values  of  the  estimates  of  A,  B^ ,  I*,  and  B^  in  the  piece- 
wise  linear  model  are  interesting.  Note  that  the  maximum  likelihood  estimate 
of  the  threshold,  I*,  was  obtained  from  the  optimization  algorithm.  In  other 
words,  maximum  likelihood  estimates  of  all  four  parameters  were  estimated 
simultaneously  (Kalwani  1980).  The  estimate  of  slope  1  (B^)  is  considerably 
smaller  than  the  estimate  of  slope  2  (B2).  In  three  of  the  five  cases,  B^  is 
equal  to  zero.  The  estimate  of  the  intercept  is  also  close  to  zero,  which 
indicates  that  in  the  case  of  packaged  goods,  the  predicted  purchase  probability 
of  those  with  stated  intentions  less  than  the  threshold  is  close  to  zero. 
Perhaps  then,  for  these  respondents  the  branded  packaged  good  is  part  of 
their  evoked  set  but  they  are  not  seriously  considering  buying  the  brand.  It 
bears  noting  that  in  four  of  the  five  cases  (the  exception  is  Product  B  from 
Tauber  (1975)),  only  those  with  stated  intentions  of  (n-l)/n  and  n/n  have  a 
significant  probability  of  purchasing  the  product.  This  finding  lends  support 
to  the  rule-of-thumb  frequently  employed  in  analyzing  concept  tests  wherein 
only  the  responses  for  the  "top  two  boxes"  of  5-point  intention  scales  are 
used  as  a  basis  for  concept  selection. 

DISCUSSION 
The  major  finding  that  emerges  from  the  results  presented  in  the  preceding 
section  is  that  the  relationship  between  expressed  intentions  and  subsequent 
purchase  behavior  appears  to  be  different  for  generic  durables  as  compared  to 
branded  packaged  goods.  Whereas  intentions  and  purchasing  were  found  to  be 
linearly  related  for  durable  goods,  the  packaged  goods  data  exhibited  a  threshold 
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effect  and  the  slope  of  the  intention-purchase  relationship  was  not  constant 
For  the  entire  set  of  durable  goods  investigated,  the  linear  .odel  provided 
a  superior  fit  to  the  data  than  either  the  null-«del  that  purchase  proba- 
bilities are  independent  of  purchase  intentions  or  the  piecewise  linear  model 
In  contrast,  for  packaged  goods  th.  linear  .odel  gave  an  inferior  fit  to  that 
Obtained  with  the  piecewise  linear  .odel .  Moreover,  the  estimates  of  the  inter- 
cept  and  slope  1  in  the  piecewise  «del  were  found  to  be  close  to  zero  indicat- 
ing that  respondents  with  intentions  below  the  threshold  level  are  unlikely  to 
buy  branded  packaged  goods.  The  t-statistics  for  the  parameter  estimates  and 
several  other  performance  measures  all  indicated  that  the  linear  model  fitted 
the  durable  goods  data  quite  well  as  did  the  piecewise  linear  model  for  the 
packaged  good  brands. 

It  Should  be  mentioned  that  a  threshold  effect  also  appears  in  results 
reported  by  Haley  and  Case  (1979.  p.  30)  and  Laroche  and  Howard  (1980.  pp.  382 
and  384)  su^.arizing  concurrent  (as  opposed  to  predictive)  relationships  bet- 
ween intentions  and  purchasing  for  packaged  good  brands.  Laroche  and  Howard 
^ployed  a  five  point  intention  scale  while  Haley  and  Cases's  data  show  a 
threshold  effect  for  both  the  five  point  intention  scale  and  the  eleven  point 
purchase  probability  scale  they  included  in  their  study.  The  existence  of  a 
threshold  effect  is  consistent  with  the  "top  box"  rule  sometimes  used  to  score 
intention  ratings  whereby  only  respondents  who  check  one  of  the  two  highest 
response  categories  are  counted  in  computing  a  criterion  measure  in  concept 
tests  and  related  types  of  studies  (Gold  and  Salkind  1974).  As  has  been  pre- 
viously suggest  by  Gold  and  SaUind  (1974).  a  likely  explanation  for  the  thresh- 
old effect  is  yeasaying  response  style  bias.  Another  not  necessarily  unrelated 
interpretation  suggested  by  Tversky  and  Kahnman's  (1974)  concept  of  how  bias 
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arises  in  judgments  due  to  reliance  on  an  "availability"  heuristic  would  be 
that  recency  and/or  frequency  of  purchase  plus  telescoping  memory  error  may 
be  the  source  of  bias  in  intention  ratings  for  packaged  goods. 

In  line  with  our  earlier  discussion  of  bias,  the  results  obtained  with 
the  linear  model  suggest  that  the  systemic  error  in  individual  intentions  for 
generic  durables  may  arise  from  different  sources  than  for  brands  of  frequently 
purchased  products.  Referring  to  the  results  for  the  Pickering  and  Isherwood 
data  summarized  in  Table  6,  we  see  the  estimates  of  the  bias  parameter,  b, 
varied  in  both  sign  and  magnitude:  b  was  positive  for  7  of  the  10  durables 
and  ranged  in  absolute  value  from  .009  to  .065.  These  results  plus  the  vari- 
ability in  both  the  beta  binomial  parameters  suggest  more  discrimination  among 
the  different  durables  than  would  be  excepted  if  a  large  component  of  response 
style  bias  was  consistently  present  in  the  intention  ratings  for  this  entire 
set  of  durables.  Recall  that  the  same  sample  of  respondents  rated  all  10  dur- 
ables. On  the  other  hand,  we  observe  from  Table  6  that  the  change  parameter, 
p,  was  generally  quite  high,  ranging  from  .298  for  new  automobiles  to  .881  for 
black  and  white  television.   All  but  two  of  the  estimates  of  p  exceeded  .5 
implying  that  for  most  of  these  durables,  half  or  more  of  the  respondents 
underwent  a  change  in  true  intentions  at  some  point  in  the  12  month  period 
covered  by  the  study. 

The  beta  binomial  model  was  consistently  found  to  fit  the  marginal  dis- 
tribution of  intentions  ratings  quite  well  for  a  broad  spectrum  of  generic 
durable  goods  but  this  was  not  the  case  for  packaged  goods  where  the  scope  of 
our  investigation  was  restricted  to  a  handful  of  brands.  For  only  two  of  the 
five  sets  of  package  goods  data  examined  could  the  null  hypothesis  of  the  beta 
binomial  model  be  sustained.  However,  in  the  two  instances  where  the  fit  was 
clearly  poor  (Smith  et  aland  Tv/yman),  the  intentions  data  analyzed  consisted 
of  ratings  for  more  than  one  product  that  had  been  pooled.  The  poor  fits 
obtained  for  the  beta  binomial  model  may  be  the  result  of  aggregating  hetero- 
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geneous  underlying  distributions  of  the  individual  products.  Thus  we  must 
regard  the  evidence  reported  here  bearing  on  the  quality  of  the  fit  of 
packaged  goods  data  to  the  beta  binomial  model  as  inconclusive. 

It  should  be  mentioned  that  our  conclusions  pertaining  to  the  compara- 
tive performance  of  alternative  forms  of  the  intention-purchase  probability 
relationship  (independence  vs.  linear  vs.  piecewise  linear)  do  not  in  any 
way  rely  on  the  assumption  that  stated  intentions  follow  a  beta  binomial  dis- 
stribution.  As  discussed  in  Morrison  (1979,  pp.  67-68),  the  beta  binomial 
assumption  along  with  the  modifications  allowing  for  instability  and  bias 
simply  represent  behavioral  assumptions  which  are  consistent  with  the  hypothesis 
of  a  linear  intention-purchase  relationship.  Therefore,  the  poor  fit  of  the 
linear  model  in  the  case  of  branded  packaged  goods  may  not  be  attributed  to  the 
failure  of  the  stated  intentions  data  to  fit  the  beta  binomial  model. 

In  his  original  paper,  Morrison  (1979,  p.  72)  suggested  that  further 
empirical  applications  of  this  model  could  lead  to  the  discovery  of  systematic 
differences  in  the  properties  of  intentions  data  that  would  relate  to  the  types 
of  purchase  stimuli  for  which  intentions  were  being  measured.  Our  findings 
indicate  that  such  a  distinction  does  exist  between  generic  or  broadly  defined 
classes  of  durable  goods  purchased  infrequently  and  specific  brands  of  packaged 
goods  from  product  categories  where  purchases  occur  frequently  and  regularly. 
Needless  to  say,  such  a  comparison  represents  only  a  modest  beginning  in 
developing  an  understanding  of  the  properties  of  intention  scales  that  is 
needed  to  address  the  issues  pertaining  to  their  usefulness  that  arise  in 
practice  when  heavy  reliance  is  placed  on  an  intention  scale  as  the  key  response 
or  criterion  measure.  Our  search  of  the  published  literature  turned  up  no  de- 
tailed evidence  relating  to  the  reliability  and/or  predictive  validity  of  inten- 
tions measures  for  specific  brands  of  durables.  Similarly,  other  critical  dis- 
tinctions such  as  that  between  new  versus  established  brands  and  concepts  versus 
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existing  products  remain  to  be  investigated  as  does  the  issue  of  parameter 
stationarity  over  time  to  which  Morrison  (1979,  p.  73)  drew  attention. 
Doubtless,  experience  and  data  relevant  to  some  or  all  of  these  questions 
has  accumulated  in  the  marketing  research  community  and  this  paper  will  have 
served  a  valuable  purpose  if  it  stimulates  further  applications  of  Morrison's 
model  and  disclosure  of  the  results. 
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SUMTIARY  AND  CONCLUSIONS 

In  this  paper  we  have  shown  how  Morrison's  model  can  serve  as  a  valuable 
framework  for  organizing  existing  evidence  bearing  on  the  key  psychometric 
properties  of  intention  scales:  unreliability,  bias,  and  instability.  Each 
of  these  qualities  represents  a  potential  threat  to  the  predictive  validity 
of  intentions  measures  and  is  explicitly  represented  in  the  model.  Our  review 
of  previously  published  findings  plus  the  results  of  the  analysis  reported 
here  may  be  summarized  as  follows: 

1.  Reliability 

Using  Morrison's  assumptions  that  stated  intentions  follow  a 
beta  binomial  distribution,  we  showed  that  reliability  is  expected 
to  vary  according  to  the  heterogenerity  of  true  intentions  among 
respondents  and  the  number  of  scale  points  included  in  the  response 
scale.  Findings  presented  from  a  small  scale  study  of  a  seven  point 
instrument  indicated  that  the  test-retest  correlations  for  inten-r 
tion  ratings  of  concept  stimuli  were  consistent  with  the  levels  of 
reliability  expected  under  the  beta  binomial  assumptions.  There 
is  a  conspicuous  absence  of  other  studies  of  the  reliability  of 
intention  scales  in  the  marketing  research  literature. 

2.  Bias,  Instability,  and  Predictive  Validity 

Evidence  of  the  predictive  validity  of  intention  ratings  was 
obtained  from   re-analyses  of  several  sets  of  previously  pub- 
lished data  but  different  forms  of  predictive  relationships  were 
found  for  generic  durable  goods  and  branded  packaged  goods.  There 
is  reason  to  believe  that  the  sources  of  systematic  error  present 
in  intention  ratings  are  different  for  these  two  types  of  pur- 
chases. 

a.  Generic  Durable-Good  Purchases 

Consistent  with  Morrison's  model,  stated  intentions  were 
found  to  fit  beta  binomial  distributions  and  be  linearly  re- 
lated to  purchase  probability  for  a  broad  spectrum  of  durable 
goods  purchases.  The  estimated  value  of  the  change  para- 
meter was  generally  quite  high  and  implied  that  a  very   sub- 
stantial fraction  of  respondents  experienced  a  change  in  true 
intention  over  a  twelve  month  period.  The  magnitude  of  bias 
was  generally  positive  but  varied  in  magnitude  over  a  limited 
range.  The  assumption  of  constant  bias  across  different  levels 
of  intention  also  appeared  tenable. 

b.  Branded  Packaged  Good  Purchases 

A  threshold  effect  appears  to  characterize  the  intention- 
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purchase  relationship  here  and  a  piecewise  linear  model  was 
found  to  provide  a  good  fit  to  the  limited  number  of  data 
sets  investigated.  The  probability  of  purchase  appears  to 
be  quite  small  for  intentions  below  the  threshold  level  and 
is  consistent  with  the  "top  box"  scoring  rule  frequently 
applied  to  intention  ratings  for  packaged  goods  brands  and 
concepts.  Response  style  and  judgment  heuristics  were 
suggested  as  likely  explanations  of  the  threshold  effect. 
Results  obtained  pertaining  to  the  fit  of  beta  binomial 
model  were  inconclusive. 

3.  Scale  Design 

There  are  reasons  for  hypothesizing  that  increasing  the  num- 
ber of  scale  points  or  response  alternatives  included  in  an  inten- 
tion instrument  may  increase  both  reliability  and  response  bias. 
There  is  insufficient  empirical  evidence  to  evaluate  these  effects 
and  thus  resolution  of  the  issue  of  optimal  scale  length  awaits 
further  study. 

These  findings  provide  some  reinforcement  for  Morrison's  (1979,  p.  73) 

belief  that  "empirical  regularities'  would  emerge  from  further  studies  of 

intentions  data.  Hopefully  the  methods  and  results  discussed  here  will 

encourage  further  applications  of  Morrison's  model  to  expand  the  scope  of 

present  knowledge  about  this  widely  used  measure. 
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APPENDIX  A 

In  this  appendix  we  obtain  an  expression  for  the  two  variance  components 
of  a  beta  binomial  distributed  intention  scale,  namely,  the  variance  of  the  condi- 
tional mean  and  the  expected  value  of  the  conditional  variance.  The  first  variance 
component  is  given  by: 

(Al) 


Varll^]  =/  I^fHt^^^  -  ^^[1^]}' 


where  f{lf)   is  the  beta  density  function  with  parameters  a  and  b.  Upon  integration, 
equation  (Al)  leads  to: 


'^^^h^-w4CU-{^) 


(a+B)^(a+3+l) 


(A2) 


The  second  variance  term  representing  the  expectation  of  the  conditional  variance 
is  given  by: 

Pt^-It^ 


E[var[4t]]   -[    \'  ^    ^(^)dl^       .  (A3) 

Dn,  equatio 

[var[e|I^]J 

(A2)  and  (, 

Var[I^]  =  Var[-I^]  +    e  [varCel   I^jj 

aB  ag 

^    (a+6)2(a+6-Hl)      '      "("-^^HaW) 

_        cxB(a+B+n) 

T, -  (A5) 

n(a+B)      a+B+1 


0 
Upon  integration,  equation  (A3)  becomes: 

E  Var[e|I^]   =   aB/n(a+B) (a+B+1)         .  (A4) 

From  equations   (A2)  and  (A4),  the  total   variance  of  I^  is  given  by: 
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APPENDIX  B 

The  purpose  of  this  appendix  is  to  derive  an  expression  for  the  linear  rela- 
tionship between  the  expected  value  of  true  intention  and  the  stated  intention.  The 
random  variable,  I  ,  representing  the  stated  intention  is  distributed  betarbinomial 
such  that: 


lVn\  X      n-x 


<:) 


n\B(a+x;e+n-x)  jdh 

xj  8^.3)      '  ^     ' 


where  BC*)  denotes  the  beta  function.  The  conditional  distribution  of  I^  for 


given  I^  is: 


P(^tlV  =  ^^   P(I) '  ^''^ 


X 

From  CB2)  we  obtain: 


Et^tiy  =Xv(itiix)^k 


=^i.£)M-/-^^<'*'^'t 


B(a+x+l;B+n-x) 
B(a+x,B+n-x) 

• 

cation  we  obtain: 

rn   It  n          ^^^ 

'-J-^t'^x-'  -   (cx+B+n) 

-  /    '^     ^  1  /"     " 

Vx 

\a+B+ni      U+B+n 

(83) 


since  x  =  nl^. 
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FOOTNOTES 


1.  Intention  is  also  a  key  construct  in  models  of  the  relationship  between 
attitudes  and  behavior.  Recent  discussions  of  this  work  in  social 
psychology  and  marketing  may  be  found  in  Bagozzi  (1981)  and  Warshaw  (1980). 

2.  Keats  and  Lord  (1962,  p.  61)  show  that  the  reliability  of  beta  binomial 
distributed  scales  (11)  is  equivalent  to  the  Kuder  Richardson  Formula  21 
(Lord  and  Novick  1968,  pp.  91-92  and  523-524)  used  in  psychometrics  to 
assess  the  reliability  of  composite  scores. 

3.  Another  source  of  systematic  error  in  the  predictive  ability  of  measured 
intentions  that  deserves  mention  here  is  that  which  arises  when  purchases 
are  the  outcome  of  a  family  decision  process,  which  is  not  reflected  in 
the  intention  measures  obtained  from  an  individual  family  member.  Davis 
and  Ragsdale  (1979)  and  Granbois  and  Summers  (1975)  have  reported  low  to 
moderate  levels  of  correlation  between  purchase  intention  measures  obtained 
from  husbands  and  wives  within  the  same  households. 
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FIGURE  Z 


TRACE  LINES  FOR  DIFFERENT  RESPONSE 
CATEGORIES  OF  A  5  POINT  INTENTION  SCALE 


Binomial  Probability  of  Endorsing 
Response  Category  x 
of  a  n  +1  point  scale 
P(X;It,h-4) 


True  Intention 
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Figure  3 


EXPECTED  RELIABILITY  AND  OBSERVED  TEST-RETEST 

CORRELATION  AS  A  FUNCTION  OF  a+6  FOR  A 

SEVEN  POINT  INTENTION  SCALE 


uu._ 
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FIGURE  4 
PIECEWISE  LINEAR  MODEL  WITH  A  THRESHOLD  AT  STATED  INTENTION,  l' 


Purchase 
Probability 
P. 


1.0 


0.5 


intercept  (A)l 


Threshold 


Stated  Intention,  I 


Slope  2  (^2) 
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PREDICTED  VS.  ACTUAL  PURCHASE  PROBABILITIES  FOR  OUSTER  DATA 


1.0 


Figure  5:  6  month  intentions  vs. 

6  month  purchases  of  automobiles 


Figure  6:  12  month  intentions  vs. 

6  month  purchases  of  automobiles 


1.0 


.0 


Ix- 


Figure  7:  12  month  intentions  vs. 

6  month  purchases  of  household 
appliances 


Figure  8:  24  month  intentions  vs. 

6  month  purchases  of  household 
appliances 
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TABLE  1 


BETA  BINOMIAL  PARAMETER  ESTIMATES,  EXPECTED  RELIABILITY 

AND  OBSERVED  TEST-RETEST  CORRELATION  FOR  RATING  OF 

TWENTY  CONCEPTS  OF  A  SEVEN  POINT  INTENTION  SCALE 


Beta  Binomial    Para-           Goodness-of-     | 

meter  Estimates   for 

Fit  Test  for 

Ratings  Obtained  on 

Beta  Binomial 

Expected 

Observed 

Concept 

First  Test* 

Distributiont 
X^           (df=4) 

Reliability 
(n/a+e+n) 

Test-Retest 
Correlation 

a 

B 

Energy-Saving  Window 

.543(.097) 

.378(.069) 

5.857 

.867 

.737 

Air  Conditioner 

Auto-Train  Service 

.754(.126) 

.626(.106) 

6.815 

.813 

.729 

Do-it-Yourself  Plumbing 

.895(.141) 

1.958(.173) 

5.320 

.754 

.700 

Fixtures 

Large  Screen  Color  Tele- 

1.115(.177) 

.942(.149) 

5.155 

.745 

.840 

vision  via  Closed  Circuit 

TV 

Investment  Counseling 

1.135(          ) 

1.030(          ) 

13.941 

.735 

.663 

Easy-to-Install   Sauna 

1.262(.195) 

1.063(.167) 

6.140 

.721 

.596 

Central   Home  Vacuum 

1.121(.174) 

1.229(.192) 

6.078 

.719 

.743 

Cleaner 

Natural    Food  Candy  Bar 

1.135(.172) 

1.317(.207) 

6.002 

.710 

.595 

Economy  Airfare 

.720(.116) 

1.860(.329) 

8.086 

.699 

.642 

(Limited  Services) 

Disposable  Fashion 

1.988(.362) 

.607(.106) 

4.884 

.698 

.507 

Raincoat 

Electric  Powered 

1.045(.159) 

1.743(.277) 

4.980 

.683 

.758 

Automobile 

Automatic  Bank  Tellers 

.975{.151) 

1.908(.308) 

9.546 

.675 

.754 

in  Supermarkets 

Combination  Clothes 

1.439(.211) 

1.778(.266) 

11.916 

.651 

.559 

Washer  and  Dryer 

Art  Rental   Service 

2.220(.341) 

1.197(.180) 

2.602 

.637 

.705 

Home  Burglar  Alarm 

1.620(.231) 

2.18G(.319) 

3.955 

.612 

.732 

System 

Pop-Top  Cans   for  Soups, 

2.016 

1.820 

13.638 

.610 

.663 

Vegetables,  etc. 

Hands-Free  Telephone 

1.762(.249) 

2.220(.321) 

14.482 

.601 

.709 

Home-Shopping  Service  via 

2.034(.292) 

2.568(.359) 

0.917 

.566 

.506 

Telephone  Connected  TV 

Disposable  Cheese-Board  & 

2.741 (.376) 

2.247(.309) 

5.727 

.546 

.510 

Dinner  Cheese  Assortment 

Video-Tape  Unit  for  Home 

2.623(.347) 

3.871(.521) 

10.590 

.480 

.722 

Education 

*The  figures   in  parenthes 

es  are  the  e 

stimated  sta 

ndard  errors  of 

a  and   g. 

.10" 


7.779;   X' 


.05 


chi- 
9.488 


square  statistic   (df=4)   are: 
;   and  x\yi    =   13.277. 
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Table  Z 


mO'ik   SUMMARY  FOR  INTENTION  RATINGS  OF  20  CONCEPTS 
BY  63  RESPONDENTS  ON  2  OCCASIONS 


Source  of  Variation 

df 

Mean 
Square 

Estimate  of 
Variance 
Component 

Proportion 

of 

Total  Variance 

Respondents 

62 

25.575 

0.612 

.155 

Concepts 

19 

50.596 

0.364 

.092 

Test  Occasions 

1 

1.625 

0 

0 

Interactions: 
Respondents  x  Concepts 
Respondents  x  Test  Occasions 
Concepts  x  Test  Occasions 

1178 
62 
19 

4.752 
1.900 
1.756 

1.826 

0.040 
0.010 

.462 
.010 
.002 

Residual 

1178 

1.101 

1.101 

.279 

Totals 

2519 

3.953 

1.000 

i     ^ 


^.5 

T3 

E    0) 

It 

q:i/) 

276/product 
(same  sample 
for  all  18 
products) 

17  products 
X  97  respon- 
dents/product= 
1,649  (data 
pooled  across 
products) 

734  (data 
pooled  for  2 
products,  same 
respondents) 

213/product 
(same  sample 
for  all  3 
products) 

0) 

J- 

3 

tn 
CO 

1 

Purchases 
reported  in  mail 
survey  conducted 
14  months  later 

Purchases  reported 
in  telephone  inter- 
view conducted  11 
days  later 

Purchases  reported 
in  diaries  kept  for 
1  week  following 
ad  test 

Purchase  made  over 
a  4  month  period 
when  given  home 
buying  opportunity 

3 

in 
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0) 

E 
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■)-> 
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Q. 
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Si 

E 
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Ratings  obtained 
following 
exposure  to  com- 
mercials 

Personal  inter- 
view after 
trial  use  in 
home  placement 
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la 
i/> 

11  point 
purchase  prob- 
ability (12 
month  time 
horizon) 

o 
+j      to 

E 

•1-      in 
§.    1 
in      Q. 

7  point 

purchase 

intention 

5  point 

purchase 

intention 

-> 

3 
O 

3. 

0) 

3 

■o    ^ 

o    i<i 

0)     m 
en    -o 

^     1 

17  name  brand  prod- 
ucts and  services 
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2  branded 
packaged  goods 
(U.K.) 

o 

CO 
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Table  5 

OBSERVED  AND  PREDICTED  STATED  INTENTION  FREQUENCIES 
FOR  OUSTER  DATA 


PRODUCT 

^x 

Autos   I 

Autos   II 

H/H 
Durables   I 

H/H 
Durables   II 

0.0 

345     (345)* 

293     (292) 

2377     (2377) 

2174     (2174) 

0.1 

29       (27) 

26       (33) 

87         (85) 

95       (113) 

0.2 

16       (15) 

21        (20) 

57          (46) 

99         (63) 

0.3 

14       (11) 

21        (15) 

29         (32) 

41          (46) 

0.4 

3         (9) 

10       (12) 

23         (26) 

38         (38) 

0.5 

10         (8) 

9        (11) 

22         (22) 

28         (33) 

0.6 

6         (7) 

12        (10) 

21          (20) 

30         (31) 

0.7 

6         (6) 

13        (10) 

14         (18) 

29          (31) 

0.8 

5          (6) 

11        (11) 

11          (18) 

36         (33) 

0.9 

5          (7) 

10       (12) 

17         (19) 

24         (39) 

1.0 

11          (9) 

21        (21) 

30         (25) 

80         (72) 

2 
Computed  Xq 

6.656 

6.937 

8.433 

32.270 

''The  figures  in  parentheses  are  predicted  frequencies  rounded  to  the 
nearest  integer. 
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Table  8 
RESULTS  FOR  PICKERING  AND  ISHERWOOD  DATA:  PIECEWISE  LINEAR  MODEL 


Product 

Piecewise  Linear  Model 
Parameters 

2 

X2 

r2 

Standard 
Error  of 
Estimate, 
S 

9 

Intercept 
A 

Slope  1 
Bl 

Threshold 
Ix 

Slope  2 
^2 

^S 

(SE(A)) 

(SE(B^)) 

(SE(I*)) 

(SE(B2)) 

Furniture 

.201 
(.026) 

.632 
(.173) 

.300 
(.041) 

.109 
(.115) 

1.185 

.745 

.063 

.047 

Used  Cars 

.130 
(.021) 

.357 
(.115) 

.600 
(.041) 

1.068 
(.284) 

1.543 

.876 

.059 

.126 

New  Cars 

.077 
(.017) 

.000 

(  — ) 

.300 
(.023) 

1.079 
(.121) 

5.673 

.901 

.068 

.266 

Washing 
Machines 

.049 
(.014) 

.000 

(  — ) 

.200 
(.014) 

.707 
(.157) 

4.198 

.628 

.101 

.188 

Record 
Players 

.067 
(.016) 

.000 

( - ) 

.300 
(.032) 

.421 
(.178) 

1.659 

.490 

.064 

.055 

Vacuum 
Cleaners 

.051 
(.014) 

.000 

(  — ) 

.307 
(.120) 

.461 
(.169) 

1.875 

.659 

.053 

.087 

Refrigerators 

.048 
(.013) 

.000 

(  — ) 

.312 
(.092) 

.853 
(.259) 

4.838 

.720 

.060 

.134 

Color  TV's 

.043 
(.012) 

.000 

( - ) 

.200 
(.024) 

.561 
(.235) 

1.932 

.500 

.061 

.086 

Cookers 

.040 
(.012) 

.000 

( - ) 

.400 
(.044) 

.594 
(.234) 

2.572 

.852 

.030 

.108 

Black  &  White 
TV's 

.043 
(.012) 

.000 

( - ) 

.500 
(.039) 

.494 
(.269) 

2.951 

.591 

.035 

.054 

Notes 

1.  The  date  in  each  case  represent  twelve  month  intentions  vs.  fourteen  month  purchases. 

2.  Sample  size  is  276  in  each  case. 

3.  Intentions  are  obtained  on  eleven  point  scales  ranging  from  0  to  10. 
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Table  10 

OBSERVED  AND  PREDICTED  STATED  INTENTION  FREQUENCIES 
FOR  PACKAGED  GOODS  DATA 


Data 

Source 

X 

Smith,  et.  al.  (1963) 

Twyman 
(1973) 

Tauber  (1975) 
Product  A 

Tauber  (1975) 
Product  B 

Tauber  (1975) 
Product  C 

0 

709  (583) 

165  (142) 

64  (60) 

49  (44) 

28  (26) 

1 

165  (242) 

39   (90) 

25  (35) 

28  (39) 

23  (31) 

2 

99  (192) 

66   (79) 

26  (31) 

36  (39) 

45  (37) 

3 

396  (200) 

90   (77) 

51   (44) 

55  (41) 

45  (47) 

4 

297  (349) 

102   (81) 

47  (52) 

45  (50) 

72  (72) 

5 

119   (97) 

6 

153  (168) 

Computed  x 

269.688 

48.677 

13.547 

9.170 

4.059 

Critical  x^ 

9.210 

13.277 

9.210 

9.210 

9.210 

''The  figures  in  parentheses  are  predicted  frequencies  rounded  to  the  nearest  integer. 
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